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PHARMACEUT IST. COMPOUNDS 

This invention relates to novel inhibitors of protein tyrosine phosphatase (PTP) 
activity, to a novel PTP form and crystal structures thereof, and to the uses of the 
5 crystal structures and novel form intermediate in the design of new drug molecules. 
The invention also provides the use of the inhibitors in medicine and in particular 
the treatment of disease states mediated by PTP activity, and to pharmaceutical 
compositions containing the compounds. 



10 Background of the Invention 

Protein tyrosine phosphatases (PTPs) are crucial in regulating signal transduction 
pathways involving tyrosine phosphorylation 1 and have been implicated in cancer, 
diabetes, rheumatoid arthritis, and hypertension 2 . 

Protein tyrosine phosphorylation plays a major role in regulation of many cell 
15 functions including response to hormones, growth factors and cytokines as well as 
in cell proliferation and apoptosis. Protein tyrosine phosphatases (PTPases) 
therefore represent an important control point in these regulatory mechanisms. 
Atypical tyrosine phosphorylation of specific proteins or components of signal 
transduction pathways has been implicated in a variety of human diseases including 
20 diabetes (diabetes type I and II) obesity, autoimmune diseases, acute and chronic 
inflammation, osteoporosis, proliferative disorders including various forms of 
cancer, growth disorders, response to infection and defective platelet aggregation. 
Such atypical tyrosine phosphorylation can result from dysregulation of both the 
kinases and or phosphatases controlling the process. In many cases PTPase activity 
25 is the major mechanism limiting the extent of phosphorylation and therefore such 
phosphatases represent key targets for therapeutic agents intended to exert 
pharmacological control over such processes. 

Type 2 diabetes is characterized by abnormalities of insulin secretion and by insulin 
resistance of the major target tissues producing a diminished uptake and 
30 metabolism of glucose. 
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Protein-tyrosine phosphatases (PTPases) play a key role in the regulation of 
reversible tyrosine phosphorylation in the insulin action pathway. The receptor for 
insulin is an integral membrane protein with tyrosine kinase activity and insulin 
signal transduction is initiated by the phosphorylation of specific tyrosyl residues 
receptor. This initiates a complex signalling cascade leading to the phosphorylation 
of several key substrates including the IRS proteins on specific tyrosine residues. 
These activation steps are balanced, in turn, by specific cellular PTPases that 
dephosphorylate and inactivate the receptor kinase and reverse the adapter function 
of the receptor substrate proteins. PTP1B is a key component of this network and 
levels of PTP1B have been reported to be increasedin diabetes associated with 
insulin resistance. Inhibition of PTP1B would therefore be expected to increase the 
strength of the signal initiated by the insulin resceptor and reverse the insulin 
resistance in such patients. 

Other cellular responses dependent on the action of tyrosine kinases are similarly 
dependent on the phosphatases which limit the strength of the response. For 
example the growth factors EOF, VEGf and PDGF all initiate a network of 
signalling cascades dependent on tyrosine phosphorylation by their specific 
receptor tyrosine kinases. The response of lymphocytes to specific antigen 
activation and of other immune cells to cytokines such as IL-6 also use non- 
, receptor tyrosine kinases as key components in their signal transduction pathways. 
Therefore inhibition of other members of the PTP family would be expected to 
control cell growth, cellular transformation, tumor formation, lymphocyte 
activation, cell migration, and inflammatory responses. 

PTP1B belongs to a large family of PTPs characterised by an 1 1 residue signature 
5 sequence (W)HCXAGXXR(S/T)G which includes the catalytic cysteine 

(Cys215) U0 . Its catalytic mechanism involves a nucleophilic attack by Cys215 on a 
phosphotyrosine substrate resulting in a covalent phosphocysteine intermediate, 
which is subsequently hydrolysed by an activated water molecule 11 . The crystal 
structure of PTP1B shows that the PTP signature motif adopts a cradle-like 
50 conformation forming the base of the active site (figure la) 12 . Its backbone amide 
atoms point to the centre of the cradle, which together with the invariant Arg221 
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provides an excellent environment to stabilise the negatively charged Cys215 side 
chain and bind the phosphate moiety of an incoming substrate 13 (figure la). A 
hydrogen bond between the hydroxyl group of Ser222 and the sulphur atom of 
Cys215 further stabilises the cysteine conformation and helps to maintain its 
5 reduced pKa (~5.4) 14 . One side of the active site is flanked by the so-called WPD- 
loop, which adopts an open conformation in the unliganded enzyme and closes over 
a bound phosphotyrosine residue 13 . The opposite side is formed by the 
phosphotyrosine (pTyr) recognition loop containing Tyr46, which mainly 
determines specificity for phosphotyrosine substrates (figure la) 15 . 

10 Increasing literature evidence suggests that the cellular redox state is involved in 
regulating PTP activity by reversibly oxidizing their catalytic cysteines. Current 
literature describes the role of the sulfenic acids (Cys-SOH) 3 " 6 . Further oxidation to 
the sulfuric (Cys-S0 2 H) and sulfonic (Cys-S0 3 H) forms causes irreversible 
inhibition. 

15 Further, there are examples of sulphur-nitrogen bonds in the literature. The reaction 
of a sulfenic acid derivative of glyceraldehyde-3-phosphate dehydrogenase with the 
small molecule, benzylamine, to form a sulfanamide has been reported by Allison et 

al 26 . 

Reich et al 27 has described studies involving the formation of cyclic selenenamide 
20 structure in a small molecule model system and suggested that in its oxidized form 
mammalian glutathione peroxidase, a selenoenzyme, may have a cyclic 
selenenamide structure. 

Summary of the Invention 

The Applicants have found that oxidation of the catalytic cysteine at the active site 
25 of PTP by oxidants leads to the formation of a sulfenyl amide moiety at the active 
site. The 'sulfenyl amide' is an isothiazolidin-3-one ring system, which has not 
been previously observed in proteins. The sulfenyl amide moiety is believed to be a 
protective intermediate in the oxidative inhibition of PTPs that prevents further 
irreversible oxidation to sulfuric and sulfonic acids. Formation of the sulfenyl 
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amide moiety at the active site leads to a loss of the enzyme's catalytic activity. 
Reduction of the sulfenyl amide moiety with a physiological reducing agent such as 
glutathione leads to regeneration of the active form of the enzyme. 

This invention is based in part on recognition that compounds that stabilize the 
5 sulfenyl amide form or effect reversible or irreversible covalent modification of the 
sulfenyl amide form will be useful as therapeutic agents. Thus, by preventing or 
inhibiting the reversion of the inactive or less active sulfenyl amide form to the 
active form of PTP, the overall level of activity of PTP within a cellular 
environment can be substantially reduced. 

10 Accordingly, the invention provides compounds that inhibit reversion of the PTP 
sulfenyl amide to the active form of PTP and their use in therapy. 

Also covered by the invention are cysteine-containing proteins which have a 
suitably nucleophilic cysteine in the active site to facilitate formation of the sulfenyl 
amide. This includes phosphatases and phosphatase-like proteins which are 
1 5 structurally homologous to the PTPs such as rhodanese and bacterial 
phosphotransferases e.g. IIBcel (also known as IIBchb). 

The term "cysteine-containing proteins" includes all proteins characterised by the 
HC(X5)R signature motif and other proteins belonging and related to this family, 
for example, those that have a remnant of this motif capable of adopting a 

20 conformation similar to the PTP phosphate binding cradle and which have a 

catalytic cysteine. A preferred set of proteins is the set in which there is an active 
site cysteine and an unusually polarised peptide bond between the active site 
cysteine and the following residue, in particular those with the HC(X5)R signature 
motif. One hypothesis is that in the HC(X5)R signature motif this bond is polarised 

25 by the conserved His in the signature motif. The HC(X5)R phosphatase family 
includes classical PTPs as well as the more distantly related low molecular weight 
(LMW) phosphatases, dual specificity phosphatases and rhodanese/CDC25 
superfamily. A preferred subset is the set of those cysteme-containing proteins that 
do not have more than one cysteine in the active site. 
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PTPs of the invention characterised by the above structural motif include all PTPs. 
In a preferred aspect of the invention this refers to PTPs without a second active site 
cysteine in close proximity of the catalytic cysteine, and more preferably those 
PTPs with one cysteine residue in the binding site. Preferred PTPs are PTPs 
5 characterised by the 1 1 residue signature sequence (I/V)HCXAGXXR(S/T)G. 
Preferred PTPs include LAR, T-cell PTP, PTP-a and PTPlb, more preferably 
PTPlb. 

The sulfenyl amide form of PTB1B has been prepared by the Applicants under 
controlled conditions and its slructure determined by X-ray diffraction analysis. 
10 The structural data can be used in methods of rational drug design to provide 

compounds that inhibit reversion of the PTP sulfenyl amide to the active form of 
PTP. 

Although the invention is specifically illustrated herein by reference to PTP1B, it is 
considered to be applicable also to other PTPs where the cellular redox state is 

15 involved in regulating PTP activity by reversibly oxidizing their catalytic cysteines 
to form sulfenyl amides. The terms "protein tyrosine phosphatase sulfenyl amide", 
"PTP sulfenyl amide" and "sulfenyl amide" used herein refer generally to PTPs in 
which a cysteine moiety at the catalytic site has been oxidized to form a sulfenyl 
amide, unless the context indicates otherwise. For the avoidance of doubt, it is 

20 noted that the terms "protein tyrosine phosphatase sulfenyl amide", "sulfenyl amide 
protein tyrosine phosphatase", "PTP sulfenyl amide" and "sulfenyl amide PTP" as 
used herein are interchangeable and refer to the same entity unless the context 
requires otherwise. 

The various aspects and embodiments of the invention are described in more detail 
25 below and defined in the claims appended hereto. 

Brief Descriptio n nf the Drawings 

Figure 1 provides a comparison of the structures of native and sulfenyl-amide 
PTP1B. 
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Figure la is a ribbon diagram of PTPIB showing the phosphate-binding cradle, the 
WPD-loop and the pTyr recognition loop. 

Figure lb shows the superimposition of the structure of native PTPIB ( light grey) 
and the sulfenyl-amide containing structure (dark grey) showing the different 
5 conformations of the pTyr recognition loop and the phosphate-binding cradle. 

Figure lc shows the electron density of the novel sulfenyl-amide derivative and its 

neighbouring residues. The electron density map in Figure lc is contoured at la. 

All figures are generated using Aesop (Martin Noble, unpublished). 

Figure 2 shows a putative mechanism of sulfenyl-amide formation and subsequent 
10 reactivation. As illustrated, the catalytic cysteine of PTPIB (E-SH) is oxidised to a 
sulfenic acid (E-S-OH). The sulfenyl-amide may be formed by a direct mechamsm 
involving a nucleophilic attack of the backbone nitrogen of Ser216 on the S, atom 
of Cys215 and subsequent release of water. Alternatively the sulfenic acid may be 
oxidised to a highly reactive intermediate by an oxidising agent e.g. by peroxide 
15 e g H 2 0 2 24 ' 25 or an oxidised thiol, which then reacts to give the sulfenyl-amide. 
Reactivation of the enzyme occurs via mixed disulfide formation with a thiol. R, 
denotes glutathione or DTT, X the leaving groups OOH (sulfenoperoxoic acid) or 
OS(0)R (sulfmothioic acid). 

Figure 3 illustrates the different oxidation states of the catalytic cysteine. The 
20 structures shown are the sulfonic (A), sulfmic (B) and sulfenic acid derivatives (C) 
of Cys215. The phosphate-binding cradle comprising residues 215 to 222 is shown 
in ball-and-stick representation. Hydrogen bonds are shown as dashed lines. The 
maps are contoured at 3a and in all maps the peaks are higher than 5a. 

Detailed Description of the Invention 
25 TVift PTP sulfe " Yl amides " f the invention 

In one aspect, the present invention contemplates an isolated PTP sulfenyl amide. 
The sulfenyl amides of the invention have a variety of uses, as described herein. 
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The terms "FTP sulfenyl amide", "sulfenyl amide PTP" and (in the context of PTP) 
"sulfenyl amide" alone are used herein as generic terms to define any PTP (as 
defined belo W )inwhichacysteine moiety at the catalytic site is oxidized to form a 

sulfenyl amide. 

The tenn "PTP" is used generally herein (and in particular in the context of the PTP 
sulfenyl amides of Ore invention) as a generic «enn <o include all members of the 
PTP protein family, whether natural, synthetic or recombinant Inferred ate PTPs 
without a second active site cystetae in close proximity to ft. catalytic cysteute, and 
more particularly preferred ate those PTPs with only one cysteine residue*, *. 
binding site. Such pteferted PTPs include LAR, T-celi PTP, PTP-a and PTP1B (tire 
lattet specificaUy exemplified herein). 

The term "PTP" is also intended to encompass PTP homologues, analogues, allelic 
forms, species variants, derivatives, muteins or equivalents, whether natural, 
synthetic or recombinant (as hereinbelow defined). 



The term "homologue" is used herein in two distinct senses. It is used sensu stricto 
,„ define proteins that share a common ancestor to the PTP. In mis sense it covers 
, orthoses (species variants which have diverged in different organisms fcllowmg 
a speciation event) and paralogues (variants which have diverge* wrttnn the same 
organism after a gene duplication even,). Thus, there is a direct evolutionary 
Kl ationship between the PTP and such homologues and tins may be reflects » 
structural ami/or functional similaritie, For example, orthologues may perform me 
, 5 same role in each organism in which they are found, while paralogues may perform 
functionally related (but distinct) roles within the same orgamsm. 
The term Is also used herein semu lata to define a PTP which is to some extent 
structurally similar (i.e. no, necessarily evolutionary related and/or arncturafiy and 
functional ly equivalent) to a given (reference) PTP (for example, to any one of 
30 LAR, T-cell PTP, PTP-a and PTP1B). In mis sense, homology is recogmsed on me 
taaia of purely statural criteria by tite presence of a*uno acid sequence .dentines 
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and/or conservative amino acid changes and/or similar secondary, tertiary or 
quaternary structures. 

In this context, a conservative amino acid substitution is one in which the amino 
acid residue is replaced with an amino acid residue having a similar side chain. 
5 Families of amino acid residues having similar side chains have been defined in the 
art (as set out for example by Dayhoff et alia, Atlas of protein structure vol. 5, 
National BioMed Fd'n, Washington D.C., 1979). These families include amino 
acids with basic side chains (e. g., lysine, arginine, histidine), acidic side chains 
(e.g. aspartic acid, glutamic acid), non-charged polar side chains (e. g. glycine, 
10 asparagine, glutamine, serine, threonine, tyrosine, cysteine), non-polar side chains 
(e. g. alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e. g. threonine, valine, isoleucine), and 
aromatic side chains (e. g. tyrosine, phenylalanine, tryptophan, histidine). 

15 The homologues of the invention therefore include proteins having at least 50%, 
55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 95% sequence identity with the 
reference PTP, and include truncated forms of naturally-occurring PTP proteins. 
Such truncates are preferably at least 25%, 35%, 50% or 75% of the length of the 
corresponding wild-type PTP and may have at least 50%, 55%, 60% or 65% 

20 sequence identity (more preferably, at least 70%, 75%, 80%, 85%, 90% or 95% 
sequence identity) with that wild-type PTP. Particularly preferred homologues are 
truncates that contain a segment preferably comprising at least 8, 15, 20 or 30 
contiguous amino acids that share at least 75%, 80%, 85%, 90% or 95% sequence 
identity with that reference PTP. 

25 Preferred truncates for PTP lb are residues 1-321 or residues 1-298 of the reference 
wild-type sequence. A particularly preferred truncate for PTPlb is the one defined 
by residues 1-321 of reference PTPlb sequence accession number PI 8031 
[SwissProt: PTN1_HUMAN] . 
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For the avoidance of doubt, full length sulfenyl amide wild-type PTPs are within 
the scope of the invention as well as truncated versions of sulfenyl amide wild-type 
PTPs. 

For the purposes of the invention, homologues may also be recognised as those 
5 proteins the corresponding DNAs of which are capable of specifically or selectively 
cross-hybridising, or which can cross-hybridise under selective, appropriate and/or 
appropriately stringent hybridisation conditions. 

The term "selectively or specifically (cross)hybridisable" in this context indicates 
10 that the sequences of the corresponding ssDNAs are such that binding to a unique 

(or small class) of homologous sequences can be obtained under more or less 

stringent hybridisation conditions. Exemplary stringent conditions can be found in, 

for example, Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. 

(1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization 
1 5 conditions is hybridization in 6x sodium chloride/sodium citrate (SSC) at about 

45°C, followed by one or more washes in 0.2x SSC, 0.1 SDS at a temperature of 

from about 50°C to 65°C. 

The term "allelic form" is used herein to define a naturally-occurring alternative 
20 form (allelic variant) of a wild-type PTP sequence which reflects naturally- 
occurring differences in the PTP gene pool. Allelic forms may be isolated and 
identified by making suitable probes or primers from the sequences provided herein 
and screening a suitable nucleic acid source from individuals of the appropriate 
species. 

25 The term "analogue" is used herein to define proteins with similar functions and/or 
structures and which are not necessarily evolutionarily related. PTP analogues 
which share function but which have no or little structural similarities are likely to 
have arisen by convergent evolution. Conversely, PTP analogues which share 
structural similarities but which exhibit few or no functional similarities are likely 

30 to have arisen by divergent evolution. PTP analogues may be identified, for 

example, by screening a library of proteins to detect those with similar functions) 
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but different physical properties, or by screening for proteins which share structural 
features but not necessarily any functions (e.g. by immunological screening). 

The term "species variant" is used herein to define the corresponding PTP from a 
different organism. Thus, species variants share a direct evolutionary relationship. 

5 

Species variants may be isolated and identified by making suitable probes or 
primers from the sequences provided herein and screening a suitable nucleic acid 
source from the desired species. Preferably, species variants are those isolated from 
mammalian species. Most preferably, species variants are those isolated from 

1 0 certain mammalian species such as, for example, Pan troglodytes, Gorilla gorilla, 
Pongo pygmaeus, Hylobates concolor, Macaca mulatta, Papio papio, Papio 
hamadryas, Cercopithecus aethiops, Cebus capucirius, Aotus trivirgatus, Sanguinus 
oedipus, Microcebus murinus, Mus musculus, Rattus norvegicus, Cricetulus 
griseus, Felis catus, Mustela vison, Canis familiaris, Oryctolagus cuniculus, Bos 

1 5 taurus, Ovis aries, Sus scrofa, and Equus caballus, for which genetic maps have 
been created allowing the identification of syntenic relationships between the 
genomic organization of genes in one species and the genomic organization of the 
related genes in another species (O'Brien and Seuanez, 1988, Ann. Rev. Genet. 22 : 
323-351 ; O'Brien et aL, 1993, Nature Genetics 3: 103-1 12; Johansson et aL, 1995, 

20 Genomics 25: 682-690; Lyons et aL, 1997, Nature Genetics 15: 47-56; O'Brien et 
aL, 1997, Trends in Genetics 13 (10): 393-399; Carver and Stubbs, 1997, 
GenomeResearch7 : 1 123-1 137; all of which are incorporated by reference herein). 

The term "derivative" as applied herein to the PTPs of the invention is used to 
25 define PTPs which are modified versions of any wild-type or truncated PTP. Such 
derivatives may include fusion proteins, in which the proteins of the invention have 
been fused to one or more different proteins, peptides or amino acid tags (for 
example an antibody or a protein domain conferring a biochemical activity, to act as 
a label, or to facilitate purification). Particularly preferred are derivatives in which 
30 the PTP proteins or peptides are modified by a polyHis (6xHis) tag to facilitate 
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purification of the peptide derivative on Ni 2+ agarose beads. It is further preferred 
that the proteins are derivatives of truncated PTP proteins. 

The derivatives may also be products of synthetic processes that use a wild-type 
5 PTP as a starting material or reactant. 

The term "mutein" is used herein to define PTPs that are mutant forms of a wild- 
type PTP, i.e. PTP proteins in which one or more amino acids have been added, 
altered, deleted, replaced, inserted or substituted. The muteins of the invention 
1 0 therefore include fragments, truncates and fusion peptides (e.g. comprising fused 
immunoglobulin, receptor, tag, label or enzyme moieties). 

The muteins of the invention therefore include truncated forms of a wild-type PTP. 
Such truncates are preferably least 25%, 35%, 50% or 75% of the length of the 
1 5 corresponding wild-type PTP and may have at least 65% sequence identity (more 
preferably, at least 70%, 75%, 80%, 85%, 90% or 95% sequence identity) with that 
PTP. 

The muteins of the invention also include PTPs in which mutations have been 
20 introduced which effectively promote or impair one or more activities of the PTP, 
for example mutations which promote or impair the function of the active site. 

Muteins may be produced by any convenient method. Conveniently, site-directed 
mutagenesis with mutagenic oligonucleotides may be employed using a double 
25 stranded template (pBluescript KS II construct containing a PTP gene), (e.g. 
Chameleon'M or QuikChange'M - StratageneM). After verifying each mutant 
derivative by sequencing, the mutated gene is excised and inserted into a suitable 
vector so that the modified protein can be over-expressed and purified. 

30 Preferred mutant forms are truncates consisting (or consisting essentially) of the 
PTP 11-residue signature sequence described herein. Particularly preferred are 
truncates that contain a segment preferably comprising at least 8, 15, 20 or 30 
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contiguous amino acids that share at least 75%, 80%, 85%, 90% or 95% sequence 
identity with the PTP from which they are derived by truncation. 

The term "equivalent" is used herein to define those PTP analogues which exhibit 
substantially the same function(s) and which share at least some structural features 
(e.g. functional domains), but which have not evolved from a common ancestor. 
Such equivalents are typically synthetic proteins (see below) and may be generated, 
for example, by identifying sequences of functional importance (e.g. by identifying 
conserved or canonical sequences, functional domains or by mutagenesis followed 
by functional assay), selecting an amino acid sequence on that basis and then 
) synthesising a peptide based on the selected amino acid sequence. Such synthesis 
can be achieved by any of many different methods known in the art, including solid 
phase peptide synthesis (to generate synthetic peptides) and the assembly (and 
subsequent cloning) of oligonucleotides. Some synthetic protein analogues may be 
chimaeras, and such equivalents can be designed and assembled for example by 
5 concatenation of two or more different structural and/or functional peptide domains 
from different proteins using recombinant DNA techniques. 

The homologies, analogues, fragments, muteins, equivalents or derivatives of the 
PTPs of the invention may also be defined inter alia as those proteins which cross- 
react with antibodies to one or more wild-type PTPs, and in particular those which 

20 cross-react with antibodies directed against a PTP lacking a second active site 
cysteine in close proximity of the catalytic cysteine (for example a PTP with only 
one cysteine residue in the binding site). Thus, the homologues, fragments, 
muteins, equivalents or derivatives of the PTPs of the invention include proteins 
which cross-react with antibodies to one or more of LAR, T-cell PTP, PTP-a and 

25 PTP1B. 



imize 



30 



For the purposes of the present invention, sequence identity is determined by 
comparing the amino acid sequences of the protein when aligned so as to max: 
overlap and identity while minimizing sequence gaps. In particular, sequence 
identity may be determined using any of a number of mathematical algorithms. A 
nonlimiting example of a mathematical algorithm used for comparison of two 
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sequences is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. 
USA 87: 2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. 
Sci. USA 90: 5873-5877. 

5 Another example of a mathematical algorithm used for comparison of sequences is 
the algorithm of Myers and Miller (1988) CABIOS 4: 1 1-17. Such an algorithm is 
incorporated into the ALIGN program (version 2.0) which is part of the GCG 
sequence alignment software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAM120 weight residue table, a gap length 

10 penalty of 12, and a gap penalty of 4 can be used. Yet another useful algorithm for 
identifying regions of local sequence similarity and alignment is the FASTA 
algorithm as described in Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 
85: 2444-2448. 

1 5 Preferred for use according to the present invention is the WU-BL AST 

(Washington University BLAST) version 2.0 software. WU-BLAST version 2.0 
executable programs for several UNIX platforms can be downloaded from ftp 
://blast. wustl. edu/blast/executables. This program is based on WU-BLAST 
version 1 .4, which in turn is based on the public domain NCBI-BLAST version 1 .4 

20 (Altschul and Gish, 1996, Local alignment statistics, Doolittle ed., Methods in 

Enzymology 266: 460-480; Altschul et al., 1990, Basic local alignment search tool, 
Journal of Molecular Biology 215: 403-410; Gish and States, 1993, Identification of 
protein coding regions by database similarity search, Nature Genetics 3: 266-272; 
Karlin and Altschul, 1993, Applications and statistics for multiple high-scoring 

25 segments in molecular sequences, Proc. Natl. Acad. Sci. USA 90: 5873-5877; all of 
which are incorporated by reference herein). 

In all search programs in the suite the gapped alignment routines are integral to the 
database search itself. Gapping can be turned off if desired. The default penalty (Q) 
30 for a gap of length one is Q=9 for proteins and BLASTP, and Q=l 0 for BLASTN, 
but may be changed to any integer. The default per-residue penalty for extending a 
gap (R) is R=2 for proteins and BLASTP, and R=10 for BLASTN, but may be 
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changed to any integer. Any combination of values for Q and R can be used in 
order to align sequences so as to maximize overlap and identity while minimizing 
sequence gaps. The default amino acid comparison matrix is BLOSUM62, but other 
amino acid comparison matrices such as PAM can be utilized. 

5 

The term "isolated" is used herein to indicate that the PTP sulfenyl amide exists in a 
physical milieu distinct from that in which it occurs in nature. For example, the 
isolated sulfenyl amide may be substantially isolated with respect to the complex 
cellular milieu in which it naturally occurs. The absolute level of purity is not 
10 critical, and those skilled in the art can readily determine appropriate levels of 

purity according to the use to which the PTP sulfenyl amide is to be put. The term 
"isolating" when used a step in a process is to be interpreted accordingly. 

In many circumstances, the isolated PTP sulfenyl amide will form part of a 
15 composition, for example a more or less crude extract containing many other 
molecules and substances, buffer systems, matrices or excipients, which may for 
example contain other components (including assay reagents and proteins, such as 
albumin). 



20 



In other circumstances, the isolated PTP sulfenyl amide may be purified to essential 
homogeneity, for example as determined by PAGE or column chromatography (for 
example HPLC or mass spectrometry). In preferred embodiments, the isolated PTP 
sulfenyl amide is essentially the sole protein in a given composition. 

25 The isolated PTP sulfenyl amide of the invention may be crystallized. Crystals of 
the isolated PTP sulfenyl amide find particular utility in some applications of the 
invention (for example, for the in silico analyses described below). 



30 



The PTP sulfenyl amides of the invention need not be isolated in the sense defined 
above, however. For example, more or less crude preparations derived from spent 
media used to culture host cells expressing PNP or PNP sulfenyl amide may be 
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used. Such supernatants may be treated in various ways, for example by oxidation, 
concentration, filtration, centrifugation, spray drying, dialysis and/or lyopbilisation. 

The term "natural" is used herein to define a PTP that has been derived from a 
5 naturaUy-occurring wild-type protein source by chemical or enzymic treatment. 
Naturally occurring PTPs may be obtained by purification (e.g. by column 
chromatography) from cellular material in which the native PTP is expressed. 

The term "synthetic PTP" is used herein to define a PTP that has been chemically 
10 synthesised in vitro (for example by any of the commercially available solid-phase 
peptide-synthesis systems). 

The term "recombinant" is used herein to define a PTP that has been produced by 
that body of techniques collectively known as "recombinant DNA technology". 

15 

Thus, although this invention is based (at least in part) on the identification and 
characterization of a novel sulfenyl amide intermediate arising from oxidation of a 
cysteine moiety at the active site of PTP1B (see below), any of a large number of 
different PTP sulfenyl amides are contemplated by the invention and each finds 
20 general application in the various methods and processes described herein. 

TTgp. r>f the PTP sulfenvl amides o f the invention ver se in Drug Discovery 

The PTP sulfenyl amides of the invention may be used inter alia in various drug 
screening processes. 

For example, the invention provides a process for screening for a PTP inhibitor 
25 comprising the steps of: (a) providing the PTP sulfenyl amide (or a homologue, 
allelic form, species variant, derivative or mutein thereof); (b) contacting the 
sulfenyl amide of step (a) with a test compound; and (c) determining whether the 
test compound binds to the sulfenyl amide. 

The screening processes of the invention as described above are preferably high 
30 throughput processes. The screens identify and/or select compounds with PTP 
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sulfenyl amide binding activity. Such compounds are candidate PTP modulators, 
and can be subjected to further analysis and/or screening in order to determine their 
activity as therapeutic agents (see for example the section headed "Assays for 
Screening for Active Compounds", below). Alternatively, or in addition, they may 
5 be crystallized with PTP1B sulfenyl amide (e.g. by co-crystallization or by soaking) 
for X-ray analysis. The resulting X-ray structure may be compared with that of 
Table 1 or Table 2 for a variety of purposes. 

For example, the PTP sulfenyl amides of the invention may be used in a process for 
producing a PTP inhibitor comprising the steps of: (a) providing a PTP sulfenyl 
10 amide (or a homologue, allelic form, species variant, derivative or mutein thereof); 
(b) contacting the sulfenyl amide of step (a) with a test compound; (c) deteimining 
whether the test compound binds to the sulfenyl amide; and (d) identifying the test 
compound as a PTP inhibitor on the basis of its ability to prevent or inhibit the 
reductive activation of the PTP sulfenyl amide to active PTP. 

15 In such processes, at least two chemically distinct test compounds may be identified 
in step (d) and the process may then further comprise the step of linking two or 
more of the chemically distinct compounds to produce a multimeric PTP inhibitor. 
Such processes embody the linked fragment approaches described in more detail in 
the section headed "Linked fragment and fragment growing approaches", below. 

20 The PTP sulfenyl amides of the invention may therefore be used in a process for 
producing a pharmaceutical composition comprising the steps of: (a) providing a 
PTP sulfenyl amide (or a homologue, allelic form, species variant, derivative or 
mutein thereof); (b) contacting the sulfenyl amide of step (a) with a test compound; 
(c) determining whether the test compound binds to the sulfenyl amide; (d) 
25 identifying the test compound as a PTP inhibitor on the basis of its ability to 

prevent or inhibit the reductive activation of the PTP sulfenyl amide to active PTP; 
and (e) incorporating the inhibitor identified in step (d) into a pharmaceutical 
excipient. 
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The invention contemplates PTP inhibitors, drugs and pharmaceutical 
compositions obtainable by, or obtained by, the process of the invention described 
above. 




This invention is based on the identification and characterization of a novel sulfenyl 
amide intermediate arising from oxidation of a cysteine moiety at the active site of a 
protein tyrosine phosphatase. 

The catalytic domain (residues 1-321) of PTP1B was expressed in E. coli cells 
10 according to known procedures and was purified and crystallized. The oxidation 
state of the catalytic cysteine of PTP1B was probed by means of soaking 
experiments using various oxidizing agents and crystal structures were 
subsequently obtained for a novel sulfenyl-amide intermediate of PTP1B, as well as 
sulfenic, sulfinic and sulfonic PTP IB derivatives. 

15 Soaking crystals of the catalytic domain of PTB1B with 2-phenyl-isoxazolidine- 
3,5-dione gave rise to a modified crystal structure (the PTP1B sulfenyl amide), the 
stocture for which has been determined by X-ray diffraction analysis. Atomic 
coordinates of the catalytic domain are set out in Table 1 and Table 2. 
Accordingly, in one aspect, the invention provides a crystal of sulfenyl amide 

20 protein tyrosine phosphatase IB . 
Crystals 

In another aspect, the invention provides a crystal of sulfenyl amide protein tyrosine 
phosphatase IB having a Unit cell dimensions: a = 87.686 A, b = 87.686 A, c - 
103.721 A, a = 90.00°, p = 90.00°, y = 120.00° and a space group: P3, 2 1. Unit 
25 cell variability of 5% may be observed in all dimensions. 

The invention also provides a crystal of sulfenyl amide protein tyrosine 
phosphatase IB having a resolution better than, i.e. numerically lower than, 3.0 A, 
preferably lower than 2.6 A. 
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The invention also provides crystals of sulfenyl amide protein tyrosine phosphatase 
IB capable of being soaked with compound(s) to form co-complex structures. 

Table 1 coordinates 

The invention also provides a crystal of sulfenyl amide protein tyrosine phosphatase 

5 IB having the structure defined by the coordinates of Table 1 . 

Table 1 gives atomic coordinate data for sulfenyl amide protein tyrosine 
phosphatase IB. In Table 1 the third column denotes the atom type, the fourth the 
residue type, the fifth the chain identification, the sixth the residue number. The 
seventh, eighth and ninth columns are the X, Y, Z coordinates respectively of the 

10 atom in question, the tenth column defines the occupancy of the atom, the eleventh 
column gives the temperature factor of the atom. 

Table 2 coordinates 

The invention further provides a crystal of sulfenyl amide protein tyrosine 
phosphatase IB having the structure defined by the coordinates of Table 2. 

15 The coordinates of Table 2 are defined as the coordinates of Table 1 as amended in 
the manner outlined in Table 2. Table 2 varies in six ways from Table 1 . These 
changes do not affect the positioning of the atoms of sulfenyl amide protein 
tyrosine phosphatase IB. The co-ordinates of Table 2 represent the same spatial 
distribution of atoms of sulfenyl amide protein tyrosine phosphatase IB as 

20 contained in Table 1 but in a format consistent with that of the EBI Macromolecular 
Structure Database (Hinxton, UK). Thus, the invention covers all co-ordinate files 
that essentially represent the same spatial distribution of sulfenyl amide protein 
tyrosine phosphatase IB atoms independent of file format. 

The coordinates of Table 1 or Table 2 provide a measure of atomic location in 
25 Angstroms, to a third decimal place. The coordinates are a relative set of positions 
that define a shape in three dimensions, so it is possible that an entirely different set 
of coordinates having a different origin and/or axes could define a similar or 
identical shape. Furthermore, varying the relative atomic positions of the atoms of 
the structure so that the root mean square deviation of the residue backbone atoms 
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(i.e. the nitrogen-carbon-carbon backbone atoms of the protein amino acid residues) 
or the C-alpha atoms is less than 1 .5 A (preferably less than 1 .0 A, more preferably 
less than 0.5 A and even more preferably less than 0.47 A) when superimposed on 
the coordinates provided in Table 1 or Table 2 for the residue backbone atoms, will 
5 generally result in a structure which is substantially the same as the structure of 
Table 1 or Table 2 in terms of both its structural characteristics and potency for 
structure-based design of PTP1B inhibitors. 

Likewise changing the number and/or positions of the water molecules and/or 
substrate molecules of Table 1 or Table 2 will not generally affect the potency of 

10 the structure for structure-based design of PTP1B inhibitors. Thus for the purposes 
described herein as being aspects of the present invention, it is within the scope of 
the invention if: the Table 1 or Table 2 coordinates are transposed to a different 
origin and/or axes; the relative atomic positions of the atoms of the structure are 
varied so that the root mean square deviation of residue backbone atoms or the C- 

1 5 alpha atoms is less than 1 .5 A (preferably less than 1 .0 A , more preferably less than 
0.5 A and even more preferably less than 0.47 A) when superimposed on the 
coordinates provided in Table 1 or Table 2 for the residue backbone atoms; and/or 
the number and/or positions of water molecules and/or substrate molecules is 
varied. References herein to the coordinate data of Table 1 or Table 2 thus include 

20 the coordinate data in which one or more individual values of the Tables are varied 
in this way. By "root mean square deviation" we mean the square root of the 
arithmetic mean of the squares of the deviations from the mean. 

It is also within the scope of the invention if the coordinate file represents the same 
spatial distribution of sulfenyl amide protein tyrosine phosphatase IB atoms but in a 

25 different file format Alternative file formats (e.g. such as a format consistent with 
that of the EBI Macromolecular Structure Database (Hinxton, UK)) which may 
include a different ordering of these atoms, or a different designation of the residues 
or residue molecule atoms, may be used or preferred by others of skill in the art. 
However it will be apparent that the use of a different file format to present or 

30 manipulate the coordinates of the Tables is within the scope of the present 

invention. Thus for the purposes described herein as being aspects of the present 
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invention, it is within the scope of the invention if the coordinates are essentially 
the same as Table 1 or Table 2, essentially comprise the coordinates of Table 1 or 
Table 2, or are a set of coordinates that materially correspond to those of Table 1 or 
Table 2. 

5 Other crystals of the invention include crystals which have selected coordinates of 
the binding pocket, wherein the amino acid residues associated with those selected 
coordinates are located in a protein framework which holds these amino acids in a 
relative spatial configuration corresponding to the spatial configuration of those 
amino acids in Table 1 or Table 2. By "corresponding to", it is meant within an 

10 r.m.s.d. of less than 2.0 A, preferably less than 1 .5 A, more preferably less than 1 .0 
A, even more preferably less than 0.5A and most preferably less than 0.47 A. In a 
further embodiment it is it within an r.m.s.d. of less than 0.3 A, less than 0.25 A, or 
less than 0.2 A, and most preferably less than 0.1 A. The amino acids which 
provide the selected coordinates are preferably selected from amino acids which 

15 form part of at least one sulfenyl amide protein tyrosine phosphatase IB binding 
cavity, where these are residues 1 to 56 as described herein or combinations thereof 
as defined further herein below. 

Those of skill in the art will appreciate that in many applications of the invention, it 
is not necessary to utilise all the coordinates of Table 1 or Table 2, but merely a 

20 portion of them. For example, as described below, in methods of modelling 
candidate compounds with PTP sulfenyl amide, selected coordinates of PTP 
sulfenyl amide may be used, for example at least 5, preferably at least 10, more 
preferably at least 20 and even more preferably at least 100 atoms of the sulfenyl 
amide structure. Likewise, the other applications of the invention described herein, 

25 including homology modelling and structure solution, and data storage and 

computer assisted manipulation of the coordinates, may also utilise all or a portion 
of the coordinates of Table 1 or Table 2, A preferred aspect of the invention is 
where the portion of the coordinates relates to the selected coordinates of the 
binding pocket. The amino acids which provide the selected coordinates are 

30 preferably selected from amino acids which form part of at least one sulfenyl amide 
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protein tyrosine phosphatase IB binding cavity, where these are residues 1 to 56 as 
described herein or combinations thereof as defined further herein below. 

It will also be appreciated that the invention also includes within its scope 
crystals of PTP sulfenyl amide comprising amino acids having the atomic 

5 coordinates of Tables 1 or 2, but wherein the crystal comprises further amino acids 
in addition to those for which the coordinates are given. Therefore, unless 
explicitly set out to the contrary, or otherwise made clear from the context, 
references throughout the present specification to the use of all or selected 
coordinates of or from Tables 1 or 2 does not exclude the use of additional 

10 coordinates due to the presence of further amino acids. 

Comparison of protein structures 

Protein structure similarity is routinely expressed and measured by the root mean 
square deviation (r.m.s.d.), which measures the difference in positioning in space 
between two sets of atoms. The r.m.s.d. measures distance between equivalent 

1 5 atoms after their optimal superposition. The r.m.s.d. can be calculated over all 
atoms, over residue backbone atoms (i.e. the nitrogen-carbon-carbon backbone 
atoms of the protein amino acid residues), main chain atoms only (i.e. the nitrogen- 
carbon-oxygen-carbon backbone atoms of the protein amino acid residues), side 
chain atoms only or more usually over C-alpha atoms only. For the purposes of this 

20 invention, the r.m.s.d. can be calculated over any of these, using any of the methods 
outlined below. 

Methods of comparing protein structures are discussed in Methods of Enzymology, 
vol. 115, pg 397-420. The necessary least-squares algebra to calculate r.m.s.d. has 
been given by Rossman and Argos (J. Biol. Chem. , vol. 250, pp7525 (1975)) 

25 although faster methods have been described by Kabsch (Acta Crystallogr., Section 
A, A92, 922 (1976); Acta Cryst A34, 827-828 (1978)), Hendrickson (Acta 
Crystallogr., Section A, A35, 158 (1979)); McLachan (J. Mol. Biol., vol 128, pp49 
(1979) and Kearsley (Acta Crystallogr., Section A, A45, 208 (1989)). Some 
algorithms use an iterative procedure in which the one molecule is moved relative 

30 to the other, such as that described by Ferro and Hermans (Ferro and Hermans, Acta 
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Crystallographies A33, 345-347 (1977)). Other methods, e.g. Kabsch's algorithm, 
locate the best fit directly. 

Programs for determining r.m.s.d include MNYFIT (part of a collection of 
programs called COMPOSER, SutclifFe, M.J., Haneef, I., Carney, D. and Blundell, 
5 T.L. (1987) Protein Engineering, 1, 377-384), MAPS (Lu, G. An Approach for 
Multiple Alignment of Protein Structures (1998, in manuscript and on 
http://bioinfol.mbfys.lu.se/TOP/maps.html)). 

It is usual to consider C-alpha atoms and the r.m.s.d. can then be calculated using 
programs such as LSQKAB (Collaborative Computational Project 4. The CCP4 
10 Suite: Programs for Protein Crystallography, Acta Ciystallographica, D50, (1994), 
760-763), QUANTA (commercially available from Accelrys, San Diego, CA), 
Insight (commercially available from Accelrys, San Diego, CA), Sybyl® 
(commercially available from Tripos, Inc., St Louis), O (Jones et al., Acta 
Crystallographica, A47, (1991), 1 10-1 19), and other coordinate fitting programs. 

1 5 In, for example, the programs LSQKAB and O, the user can define the residues in 
the two proteins that are to be paired for the purpose of the calculation. 
Alternatively, the pairing of residues can be determined by generating a sequence 
alignment of the two proteins, programs for sequence alignment are discussed in 
more detail above. The atomic coordinates can then be superimposed according to 

20 this alignment and an r.m.s.d. value calculated. The program Sequoia (CM. Bruns, 
I. Hubatsch, M. RidderstrSm, B. Mannervik, and J. A. Tainer (1999) Human 
Glutathione Transferase A4-4 Crystal Structures and Mutagenesis Reveal the Basis 
of High Catalytic Efficiency with Toxic Lipid Peroxidation Products, Journal of 
Molecular Biology 288(3): 427-439) performs the alignment of homologous protein 

25 sequences, and the superposition of homologous protein atomic coordinates. Once 
aligned, the r.m.s.d. can be calculated using programs detailed above. For 
sequences identical, or highly identical, the structural alignment of proteins can be 
done manually or automatically as outlined above. Another approach would be to 
generate a superposition of protein atomic coordinates without considering the 

30 sequence. 
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It is more normal when comparing significantly different sets of coordinates to 
calculate the r.m.s.d. value over C-alpha atoms only. It is particularly useful when 
analysing side chain movement to calculate the r.m.s.d. over all atoms and this can 
be done using LSQKAB and other programs. 

5 Mutants 

Also, modifications in the sulfenyl amide protein tyrosine phosphatase IB crystal 
structure due to e.g. mutations, additions, substitutions, and/or deletions of amino 
acid residues could account for variations in the atomic coordinates. However, 
atomic coordinate data of sulfenyl amide protein tyrosine phosphatase IB modified 

10 so that a ligand that bound to one or more binding sites of sulfenyl amide protein 
tyrosine phosphatase IB would be expected to bind to the corresponding binding 
sites of the modified sulfenyl amide protein tyrosine phosphatase IB are, for the 
purposes described herein as being aspects of the present invention, also within the 
scope of the invention. References herein to the coordinates of Table 1 or Table 2 

15 thus include the coordinates modified in this way. Preferably, the modified 

coordinate data define at least one sulfenyl amide protein tyrosine phosphatase IB 
binding cavity. 

Crystals of the invention also include crystals of sulfenyl amide protein tyrosine 
phosphatase IB mutants. In addition, sulfenyl amide protein tyrosine phosphatase 
20 IB mutants may be crystallized in co-complex with known sulfenyl amide protein 
tyrosine phosphatase IB substrates or inhibitors or novel compounds. 

As explained herein, a mutant sulfenyl amide protein tyrosine phosphatase IB is a 
sulfenyl amide protein tyrosine phosphatase IB protein characterized by the 
25 replacement or deletion of at least one amino acid from the wild type PTP1B. Such 
a mutant may be prepared for example by site-specific mutagenesis, or 
incorporation of natural or unnatural amino acids. 

As explained herein, the present invention therefore contemplates sulfenyl amide 
30 protein tyrosine phosphatase IB mutants as hereinbefore defined. 
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For example, the sulfenyl amide protein tyrosine phosphatase IB mutants may 
define a polypeptide which is obtained by replacing at least one amino acid residue 
in a native or synthetic sulfenyl amide protein tyrosine phosphatase IB with a 
different amino acid residue and/or by adding and/or deleting amino acid residues 
within the native polypeptide or at the N- and/or C-terminus of a polypeptide 
corresponding to sulfenyl amide protein tyrosine phosphatase IB, and which has 
substantially the same three-dimensional structure as sulfenyl amide protein 
tyrosine phosphatase IB from which it is derived. By having substantially the same 
three-dimensional structure is meant having a set of atomic structure co-ordinates 
that have a root mean square deviation (r.m.s.d.) of less than or equal to about 1.5 
A, preferably less than 0.47 A, when superimposed with the atomic structure co- 
ordinates of the sulfenyl amide protein tyrosine phosphatase IB from which the 
mutant is derived when at least about 50% to 100% of the C a atoms of the sulfenyl 
amide protein tyrosine phosphatase IB are included in the superposition. A mutant 
may have, but need not have, enzymatic or catalytic activity. 

To produce homologues or mutants, amino acids present in the said protein can be 
replaced by other amino acids having similar properties, for example 
hydrophobicity, hydrophobic moment, antigenicity, propensity to form or break a- 
helical or (3-sheet structures, and so. Substitutional variants of a protein are those in 
which at least one amino acid in the protein sequence has been removed and a 
different residue inserted in its place. Amino acid substitutions are typically of 
single residues but may be clustered depending on functional constraints e.g. at a 
crystal contact. Preferably amino acid substitutions will comprise conservative 
amino acid substitutions. Insertional amino acid variants are those in which one or 
more amino acids are introduced. This can be amino-terminal and/or carboxy- 
terminal fusion as well as intrasequence. Examples of amino-terminal and/or 
carboxy-terminal fusions are affinity tags, MBP tag, and epitope tags. 

Amino acid substitutions, deletions and additions that do not significantly interfere 
with the three-dimensional structure of the sulfenyl amide protein tyrosine 
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phosphatase IB will depend, in part, on the region of the sulfenyl amide protein 
tyrosine phosphatase IB where the substitution, addition or deletion occurs. In 
highly variable regions of the molecule, non-conservative substitutions as well as 
conservative substitutions may be tolerated without significantly disrupting the 
5 three-dimensional structure of the molecule. In highly conserved regions, or 
regions containing significant secondary structure, conservative amino acid 
substitutions are preferred. 

As explained earlier, conservative amino acid substitutions are well known in the 
10 art, and include substitutions made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the 
amino acid residues involved. For example, negatively charged amino acids 
include aspartic acid and glutamic acid; positively charged amino acids include 
lysine and arginine; amino acids with uncharged polar head groups having similar 
15 hydrophilicity values include the following: leucine, isoleucine, valine; glycine, 
alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. Other 
conservative amino acid substitutions are well known in the art. 

In some instances, it may be particularly advantageous or convenient to substitute, 
20 delete and/or add amino acid residues to a sulfenyl amide protein tyrosine 

phosphatase IB binding pocket or catalytic residue in order to provide convenient 
cloning sites in the cDNA encoding the polypeptide, to aid in purification of the 
polypeptide, to modify compound binding etc. Such substitutions, deletions and/or 
additions which do not substantially alter the three dimensional structure of sulfenyl 
25 amide protein tyrosine phosphatase IB will be apparent to those having skills in the 
art. 

It should be noted that the mutants contemplated herein need not exhibit enzymatic 
activity. Indeed, amino acid substitutions, additions or deletions that interfere with 
30 the catalytic activity of the protein tyrosine phosphatase IB but which do not 
significantly alter the three-dimensional structure of the catalytic region are 
specifically contemplated by the invention. Such crystalline polypeptides, or the 
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atomic structure co-ordinates obtained there from, can be used to identify 
compounds that bind to the protein. 

The crystallization of such mutants and the determination of the three-dimensional 
5 structures by X-ray crystallography rely on the ability of the mutant proteins to 

yield crystals that diffract at high resolution. The mutant protein could then be used 
to obtain information on compound binding through the determination of mutant 
protein/ligand complex structures, which may be characterized using the sulfenyl 
amide protein tyrosine phosphatase IB crystal structure of Table 1 or Table 2. 

10 

The mutations can be introduced by site-directed mutagenesis e.g. using a 
Stratagene QuikChange™ Site-Directed Mutagenesis Kit or cassette mutagenesis 
methods (see e.g. Ausubel et al., eds., Current Protocols in Molecular Biology, 
John Wiley & Sons, Inc., New York, and Sambrook et al., Molecular Cloning: a 
15 Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, (1989)). 

To the extent that the present invention relates to sulfenyl amide protein tyrosine 
phosphatase IB -ligand complexes and mutant and homologue proteins of sulfenyl 
20 amide protein tyrosine phosphatase IB, crystals of such proteins may be formed. 
The skilled person would recognize that the conditions provided herein for 
crystallizing sulfenyl amide protein tyrosine phosphatase IB may be used to form 
such crystals. Alternatively, the skilled person would use the conditions as a basis 
for identifying modified conditions for forming the crystals. 

25 

Thus the aspects of the invention relating to crystals of sulfenyl amide protein 
tyrosine phosphatase IB, may be extended to crystals of a mutant, analogue or 
homologue, or mutein 



30 



Homology Modelling 

The invention also provides a means for homology modelling of other proteins 
(referred to below as target sulfenyl amide protein tyrosine phosphatase proteins). 
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By "homology modelling", it is meant the prediction of related sulfenyl amide 
protein tyrosine phosphatase structures based either on X-ray crystallographic data 
or computer-assisted de novo prediction of structure, based upon manipulation of 
the coordinate data of Table 1 or Table 2. 

5 

"Homology modeling" extends to target sulfenyl amide protein tyrosine 
phosphatase proteins, which axe analogues or homologues of the sulfenyl amide 
protein tyrosine phosphatase IB protein whose structure has been determined in the 
accompanying examples. 

10 

The term "homologous regions" describes amino acid residues in two sequences 
that are identical or have similar (e.g. aliphatic, aromatic, polar, negatively charged, 
or positively charged) side-chain chemical groups. Identical and similar residues in 
homologous regions are sometimes described as being respectively "invariant" and 
1 5 "conserved" by those skilled in the art. 

In general, the method involves comparing the amino acid sequences of the sulfenyl 
amide protein tyrosine phosphatase IB protein of Table 1 or Table 2 with a target 
sulfenyl amide protein tyrosine phosphatase protein by aligning the amino acid 
20 sequences (Dunbrack et al., Folding and Design, 2, (1997), 27-42). Amino acids in 
the sequences are then compared and groups of amino acids that are homologous 
(conveniently referred to as "corresponding regions") are grouped together. This 
method detects conserved regions of the polypeptides and accounts for amino acid 
insertions or deletions. 

25 

Homology between amino acid sequences can be determined using commercially 
available algorithms. The programs BLAST, GAPPED BLAST, BLASTN, PSI- 
BLASTAND BLAST 2 sequences (provided by the National Center for 
Biotechnology Information) are widely used in the art for this purpose, and can 
30 align homologous regions of two amino acid sequences. These may be used with 
default parameters to determine the degree of homology between the amino acid 
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sequence of the Table 1 or Table 2 protein and other target sulfenyl amide protein 
tyrosine phosphatase proteins, which are to be modeled. 

Analogues are defined as proteins with similar three-dimensional structures and/or 
5 functions with little evidence of a common ancestor at a sequence level. 

Homologues are defined as previously as proteins with evidence of a common 
ancestor, i.e. likely to be the result of evolutionary divergence and are divided into 
remote, medium and close sub-divisions based on the degree (usually expressed as 
10 a percentage) of sequence identity. 

A homologue is defined here as a protein with at least 15% sequence identity or 
which has at least one functional domain, which is characteristic of sulfenyl amide 
protein tyrosine phosphatase IB. 

15 

There are two types of homologue: orthologues and paralogues. Orthologues are 
defined as homologous genes in different organisms, i.e. the genes share a common 
ancestor coincident with the speciation event that generated them. Paralogues are 
defined as homologous genes in the same organism derived from a 
20 gene/chromosome/ genome duplication, i.e. the common ancestor of the genes 
occurred since the last speciation event. 

The homologues could also be mutants as described above. 

25 Once the amino acid sequences of the polypeptides with known and unknown 
structures are aligned, the structures of the conserved amino acids in a computer 
representation of the polypeptide with known structure are transferred to the 
corresponding amino acids of the polypeptide whose structure is unknown. For 
example, a tyrosine in the amino acid sequence of known structure may be replaced 

30 by a phenylalanine, the corresponding homologous amino acid in the amino acid 
sequence of unknown structure. 
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The structures of amino acids located in non-conserved regions may be assigned 
manually by using standard peptide geometries or by molecular simulation 
techniques, such as molecular dynamics. The final step in the process is 
accomplished by refining the entire structure vising molecular dynamics and/or 
5 energy minimization. 

Homology modeling as such is a technique that is well known to those skilled in the 
art (see e.g. Greer, Science, vol. 228, (1985), 1055, and Blundell etal, Eur. J. 
Biochern, vol. 172, (1988), 513). The techniques described in these references, as 
10 well as other homology modeling techniques, generally available in the art, may be 
used in performing the present invention. 

Thus the invention provides a method of homology modeling comprising the steps 
of: (a) aligning a representation of an amino acid sequence of a target sulfenyl 

1 5 amide protein tyrosine phosphatase protein of unknown three-dimensional structure 
with the amino acid sequence of the sulfenyl amide protein tyrosine phosphatase IB 
of Table 1 or Table 2 to match homologous regions of the amino acid sequences; 
(b) modeling the structure of the matched homologous regions of said target 
sulfenyl amide protein tyrosine phosphatase of unknown structure on the 

20 corresponding regions of the sulfenyl amide protein tyrosine phosphatase IB 

structure as defined by Table 1 or Table 2; and (c) determining a conformation (e.g. 
so that favorable interactions are formed within the target sulfenyl amide protein 
tyrosine phosphatase of unknown structure and/or so that a low energy 
conformation is formed) for said target sulfenyl amide protein tyrosine phosphatase 

25 of unknown structure which substantially preserves the structure of said matched 
homologous regions. 

Preferably one or all of steps (a) to (c) are performed by computer modeling. 

30 The aspects of the invention described herein which utilize the sulfenyl amide 
protein tyrosine phosphatase IB structure in silico may be equally applied to 
homologue models of sulfenyl amide protein tyrosine phosphatase obtained by the 
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above aspect of the invention, and this application forms a further aspect of the 
present invention. Thus having determined a conformation of a sulfenyl amide 
protein tyrosine phosphatase by the method described above, such a conformation 
may be used in a computer-based method of rational drug design as described 
5 herein. 

In a preferred aspect of this invention the co-ordinates are used to model the 
structure of target sulfenyl amide protein tyrosine phosphatases, particularly 
homologues of sulfenyl amide protein tyrosine phosphatase IB, for example PTP-a, 
10 T-cell PTP, or LAR. 

Structure solution 

The structure of the human sulfenyl amide protein tyrosine phosphatase IB can also 
be used to solve the crystal structure of other target sulfenyl amide protein tyrosine 
15 phosphatase proteins including other crystal forms of sulfenyl amide protein 
tyrosine phosphatase IB, mutants, and co-complexes of sulfenyl amide protein 
tyrosine phosphatase IB, where X-ray diffraction data or NMR spectroscopic data 
of these target sulfenyl amide protein tyrosine phosphatase proteins have been 
generated and require interpretation in order to provide a structure. 

20 

In the case of sulfenyl amide protein tyrosine phosphatase IB, this protein may 
crystallize in more than one crystal form. The structure coordinates of sulfenyl 
amide protein tyrosine phosphatase IB, or portions thereof, as provided by this 
invention are particularly useful to solve the structure of those other crystal forms 
25 of sulfenyl amide protein tyrosine phosphatase IB. They may also be used to solve 
the structure of sulfenyl amide protein tyrosine phosphatase IB mutants, sulfenyl 
amide protein tyrosine phosphatase IB co-complexes, or the structure of the 
crystalline form of any other protein with significant amino acid sequence 
homology to sulfenyl amide protein tyrosine phosphatase IB. 

30 

In the case of other target sulfenyl amide protein tyrosine phosphatase proteins, 
particularly the mutant sulfenyl amide protein tyrosine phosphatase proteins 
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referred to above, the present invention allows the structures of such targets to be 
obtained more readily where raw X-ray diffraction data are generated. 

Thus, where X-ray crystallographic or NMR spectroscopic data are provided for a 
5 target sulfenyl amide protein tyrosine phosphatase lB-ligand complex, or a sulfenyl 
amide protein tyrosine phosphatase IB homologue or analogue of unknown three- 
dimensional structure, the structure of sulfenyl amide protein tyrosine phosphatase 
IB, as defined by Table 1 or Table 2, may be used to interpret the data to provide a 
likely structure for the other sulfenyl amide protein tyrosine phosphatases by 
1 0 techniques which are well known in the art, e.g. phasing in the case of X-ray 
crystallography and assisting peak assignments in NMR spectra. 

One method that may be employed for these purposes is molecular replacement. In 
this method, the unknown crystal structure, whether it is another crystal form of 

1 5 sulfenyl amide protein tyrosine phosphatase IB, a sulfenyl amide protein tyrosine 
phosphatase IB mutant, or a sulfenyl amide protein tyrosine phosphatase IB co- 
complex, or the crystal of a target sulfenyl amide protein tyrosine phosphatase 
protein with amino acid sequence homology to protein tyrosine phosphatase IB, 
may be determined using the sulfenyl amide protein tyrosine phosphatase IB 

20 structure coordinates of this invention as provided herein. This method will provide 
an accurate structural form for the unknown crystal more quickly and efficiently 
than attempting to determine such information ab initio. 

Examples of computer programs known in the art for performing molecular 
25 replacement are CNX (Brunger A.T.; Adams P.D.; Rice L.M., Current Opinion in 
Structural Biology, Volume 8, Issue 5, October 1998, Pages 606-611 (also 
commercially available from Accelrys San Diego, CA) or Amore (Navaza, J. 
(1994). Amore: An Automated Package for Molecular Replacement. Acta Cryst. 
A50, 157-163). 

30 

Thus, in a further aspect of the invention provides a method for determining the 
structure of a protein, which method comprises providing the co-ordinates of Table 
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1 or Table 2, and either (a) positioning the co-ordinates in the crystal unit cell of 
said protein so as to provide a structure for said protein or (b) assigning NMR 
spectra peaks of said protein by manipulating the coordinates of Table 1 or Table 2. 

5 In a preferred aspect of this invention the co-ordinates are used to solve the 
structure of target sulfenyl amide protein tyrosine phosphatase, particularly 
homologues of sulfenyl amide protein tyrosine phosphatase IB, for example PTP-a, 
T-cell PTP, or LAR. 

Computer systems 

10 In another aspect, the present invention provides systems, particularly a computer 
system, the systems containing either (a) atomic coordinate data according to Table 
1 or Table 2, said data defining the three-dimensional structure of sulfenyl amide 
protein tyrosine phosphatase IB or at least selected coordinates thereof; (b) 
structure factor data (where a structure factor comprises the amplitude and phase of 

15 the diffracted wave) for sulfenyl amide protein tyrosine phosphatase IB, said 

structure factor data being derivable from the atomic coordinate data of Table 1 or 
Table 2; (c) atomic coordinate data of a target sulfenyl amide protein tyrosine 
phosphatase protein generated by homology of the target based on the data of 
Table 1 or Table 2; (d) atomic coordinate data of a target sulfenyl amide protein 

20 tyrosine phosphatase protein generated by interpreting X-ray crystallographic data 
or NMR data by reference to the data of Table 1 or Table 2; or (e) structure factor 
data derivable from the atomic coordinate data of (c) or (d). 

For example the computer system may comprise: (i) a computer-readable data 
25 storage medium comprising data storage material encoded with the computer- 
readable data; (ii) a working memory for storing instructions for processing said 
computer-readable data; and (iii) a central-processing unit coupled to said working 
memory and to said computer-readable data storage medium for processing said 
computer-readable data and thereby generating structures and/or performing 
30 rational drug design. The computer system may further comprise a display coupled 
to said central-processing unit for displaying said structures. 
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The invention also provides such systems containing atomic coordinate data of 
target sulfenyl amide protein tyrosine phosphatase proteins wherein such data have 
been generated according to the methods of the invention described herein based on 
5 the starting data provided by Table 1 or Table 2. 

Such data are useful for a number of purposes, including the generation of 
structures to analyze the mechanisms of action of sulfenyl amide protein tyrosine 
phosphatase IB proteins and/or to perform rational drug design of compounds 
10 which interact with sulfenyl amide protein tyrosine phosphatase IB, such as 

compounds which are inhibitors of sulfenyl amide protein tyrosine phosphatase IB. 

In another aspect, the invention provides a computer-readable storage medium, 
comprising a data storage material encoded with computer readable data, wherein 

15 the data are defined by all or a portion (e.g. selected coordinates as defined herein) 
of the structure coordinates of sulfenyl amide protein tyrosine phosphatase IB of 
Table 1 or Table 2, or a homologue of sulfenyl amide protein tyrosine phosphatase 
IB, wherein said homologue comprises backbone atoms that have a root mean 
square deviation from the backbone atoms (nitrogen-carbona-carbon) of Table 1 or 

20 Table 2 of not more than 1 .5 A. 

The invention also provides a computer-readable data storage medium comprising a 
data storage material encoded with a first set of computer-readable data comprising 
a Fourier Transform of at least a portion (e.g. selected coordinates as defined 

25 herein) of the structural coordinates for sulfenyl amide protein tyrosine phosphatase 
IB according to Table 1 or Table 2; which, when combined with a second set of 
machine readable data comprising an X-ray diffraction pattern of a molecule or 
molecular complex of unknown structure, using a machine programmed with the 
instructions for using said first set of data and said second set of data, can determine 

30 at least a portion of the structure coordinates corresponding to the second set of 
machine readable data. 
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In a further aspect, the present invention provides computer readable media with at 
least one of: (a) atomic coordinate data according to Table 1 or Table 2 recorded 
thereon, said data defining the three-dimensional structure of sulfenyl amide protein 
tyrosine phosphatase IB, or at least selected coordinates thereof; (b) structure factor 
5 data for sulfenyl amide protein tyrosine phosphatase IB recorded thereon, the 

structure factor data being derivable from the atomic coordinate data of Table 1 or 
Table 2; (c) atomic coordinate data of a target sulfenyl amide protein tyrosine 
phosphatase protein generated by homology modeling of the target based on the 
data of Table 1 or Table 2; (d) atomic coordinate data of a sulfenyl amide protein 
10 tyrosine phosphatase lB-ligand complex or a sulfenyl amide protein tyrosine 
phosphatase IB homologue or analogue generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 or Table 2; 
and (e) structure factor data derivable from the atomic coordinate data of (c) or (d). 

15 By providing such computer readable media, the atomic coordinate data can be 
routinely accessed to model sulfenyl amide protein tyrosine phosphatase IB or 
selected coordinates thereof. For example, Rasmol (Sayle et al., TIBS, vol. 20, 
(1995), 374) is a publicly available computer software package which allows access 
and analysis of atomic coordinate data for structure determination and/or rational 

20 drug design. 

On the other hand, structure factor data, which are derivable from atomic coordinate 
data (see e.g. Blundell et al., in Protein Crystallography, Academic Press, New 
York, London and San Francisco, (1976)), are particularly useful for calculating 
25 e.g. difference Fourier electron density maps. 

A further aspect of the invention provides a method of providing data for generating 
structures and/or performing rational drug design for sulfenyl amide protein 
tyrosine phosphatase IB, sulfenyl amide protein tyrosine phosphatase IB 
30 homologues or analogues, complexes of sulfenyl amide protein tyrosine 

phosphatase IB with a candidate modulator, or complexes of sulfenyl amide protein 
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tyrosine phosphatase IB homologues or analogues with candidate modulators, the 
method comprising: 

(i) establishing communication with a remote device containing computer-readable 
data comprising at least one of: (a) atomic coordinate data according to Table 1 or 

5 Table 2, said data defining the three-dimensional structure of sulfenyl amide protein 
tyrosine phosphatase IB, at least one sub-domain of the three-dimensional structure 
of sulfenyl amide protein tyrosine phosphatase IB, or the coordinates of a portion 
of atoms of sulfenyl amide protein tyrosine phosphatase IB; (b) structure factor 
data for sulfenyl amide protein tyrosine phosphatase IB, said structure factor data 

10 being derivable from the atomic coordinate data of Table 1 or Table 2; (c) atomic 
coordinate data of a target sulfenyl amide protein tyrosine phosphatase IB 
homologue or analogue generated by homology modeling of the target based on the 
data of Table 1 or Table 2; (d) atomic coordinate data of a protein generated by 
interpreting X-ray crystallographic data or NMR data by reference to the data of 

15 Table 1 or Table 2; and (e) structure factor data derivable from the atomic 

coordinate data of (c) or (d); and (ii) receiving said computer-readable data from 
said remote device. 

Thus the remote device may comprise e.g. a computer system or computer readable 
20 media of one of the previous aspects of the invention. The device may be in a 

different country or jurisdiction from where the computer-readable data is received. 
The communication may be via the internet, intranet, e-mail etc. Typically the 
communication will be electronic in nature, but some or all of the communication 
pathway may be optical, for example, over optical fibre transmission lines. 

25 Drug Discovery 

Determination of the 3D structure of PTP1B provides important information about 
the nature of the changes to the active site of PTPs upon oxidation, in particular the 
changes in PTP1B upon oxidation to the PTP1B sulfenyl amide. In particular, the 
X-ray data provide information about new binding sites created by distortion of the 
30 active site as a consequence of the formation of the sulfenyl amide. Information 
about the new binding sites can then be used for rational design of compounds that 
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bind to PTPsulfenyl amide, especially PTP1B sulfenyl amide. This can be achieved 
by e.g. computational techniques which identify possible binding ligands for the 
active sites, by enabling linked-fragment approaches to drug design, and by 
enabling the identification and location of bound ligands using X-ray 
5 crystallographic analysis. These techniques are discussed in more detail below. 



In silico Analysis 

The provision of the crystal structure of PTP1B sulfenyl amide allows a novel 
approach for drug discovery for modulators of this inactive form of PTP and in 
particular of PTP1B. Accordingly, the invention provides a computer-based 
1 0 method of rational drug design which comprises: 

providing the structure of the PTPlb sulfenyl amide as defined by the 
coordinates of Table 1 or Table 2; 

providing the structure of a candidate modulator molecule; and 
fitting the structure of candidate to the structure of the sulfenyl amide of 
15 Table 1 or Table 2. 

More particularly, the crystal structure of the sulfenyl amide can be used to design 
drug molecules that bind to the sulfenyl amide of PTP1B to inhibit or prevent its 
conversion to the active form of PTP1B and hence another aspect of the invention 
comprises a computer-based method of rational drug design which comprises; 
20 providing the structure of the PTP IB sulfenyl amide as defined by the 

coordinates of Table 1 or Table 2; 

providing the structure of a candidate compound; and 
fitting the structure of the candidate compound to the structure of the 
sulfenyl amide as defined by the coordinates of Table 1 or Table 2. 

25 The invention further provides a method of identifying by rational drug design a 
compound capable of reducing the level of activity of a protein tyrosine 
phosphatase (PTP) in a cellular environment, the PTP being one which is 
convertible in a cellular environment between an active form and an inactive or less 
active form, the inactive form or less active form being characterised by the 

30 presence of a sulfenyl amide moiety formed at the active site of the PTP between 
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the sulphur atom of a cysteine group and a backbone nitrogen atom of a 
neighbouring amino acid; 

which method comprises: 

(a) designing a ligand that will (i) bind to the active site in the region of the 

5 sulfenyl amide moiety to inhibit conversion of the inactive form or less active form 
back to the active form, or (ii) modify the sulfenyl amide moiety to inhibit 
conversion of the inactive form or less active form of the PTP to the active form; 

(b) synthesizing the ligand; and 

(c) determining whether the ligand reduces the level of activity of a protein 
10 tyrosine phosphate (PTP) in a cellular environment. 

In an alternative aspect, the method of the invention may utilise the coordinates of 
atoms of interest of the PTP1B which are in the vicinity of a putative binding region 
in order to model the pocket in which the a ligand will bind. These coordinates 
may be used to define a space which is then screened "/« silico " against a candidate 
1 5 modulator molecule. 

Thus the invention provides a computer-based method of rational drug design 
which comprises: 

providing the coordinates of at least two atoms of Table 1 or Table 2 of the 
PTP1B sulfenyl amide ("selected coordinates"); 
20 providing the structure of a candidate modulator molecule; and 

fitting the structure of candidate to the selected coordinates of the PTP IB 
sulfenyl amide. 

In practice, it will be desirable to model a sufficient number of atoms of the PTP IB 
sulfenyl amide as defined by the coordinates of Table 1 or Table 2 which represent 
25 a binding pocket. Binding pockets and other features of the interaction of PTP IB 
sulfenyl amide with a putative compound of the invention are described below. 
Thus, in this embodiment of the invention, there will preferably be provided the 
coordinates of at least 5, preferably at least 10, more preferably at least 50 and even 
more preferably at least 100 atoms of the PTP sulfenyl amide structure. 
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By "fitting", it is meant determining by automatic, or semi-automatic means, 
interactions between at least one atom of the candidate and at least one atom of the 
PTP1B sulfenyl amide , and calculating the extent to which such an interaction is 
stable. Interactions include attraction and repulsion, brought about by charge, steric 
considerations and the like. Various computer-based methods for fitting are 
described fiuther herein. 

By "binding site" we mean a site (such as an atom, a functional group of an amino 
acid residue or a plurality of such atoms and/or groups) in a PTP1B sulfenyl amide 
binding cavity which may bind to a candidate ligand. Depending on the particular 
molecule in the cavity, sites may exhibit attractive or repulsive binding interactions, 
brought about by charge, steric considerations and the like. 

As a result of the determination of the PTP1B sulfenyl amide 3D structure, more 
purely computational techniques for rational drug design may also be used to design 
PTP sulfenyl amide ligands (for an overview of these techniques see e.g Walters et 
al (Drug Discovery Today, Vol.3, No.4, (1998), 160-178; Abagyan, R.; Totrov, M. 
Curr. Opin. Chem. Biol. 2001, 5, 375-382)). For example, automated ligand- 
receptor docking programs (discussed e.g. by Jones et al. in Current Opinion in 
Biotechnology, Vol.6, (1995), 652-656 and Halperin, I.; Ma, B.; Wolfson, H.; 
Nussinov, R. Proteins 2002, 47, 409-443) which require accurate information on 
the atomic coordinates of target receptors may be used to design potential PTP1B 
sulfenyl amide ligands. 

The step of providing the structure of a candidate ligand molecule may involve 
selecting the compound by computationally screening a database of compounds for 
interaction with the active site. For example, a 3-D descriptor for the candidate 
modulator may be derived, the descriptor including geometric and functional 
constraints derived from the architecture and chemical nature of the binding site. 
The descriptor may then be used to interrogate the compound database, a candidate 
ligand being a compound that has a good match to the features of the descriptor. In 
effect, the descriptor is a type of virtual pharmacophore. 
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In any event, the determination of the three-dimensional structure of PTP1B 
sulfenyl amide provides a basis for the design of new and specific ligands for 
PTP1B sulfenyl amide. For example, knowing the three-dimensional structure of 
PTP1B sulfenyl amides, computer modelling programs may be used to design 
5 different molecules expected to interact with possible or confirmed active sites, 
such as binding sites or other structural or functional features of PTP1B sulfenyl 
amide. 

More specifically, a candidate ligand for PIT IB sulfenyl amide can be examined 
through the use of computer modelling using a docking program such as GOLD 

10 (Jones et al., J. Mol Biol, 245, 43-53 (1995), Jones et aL, J. Mol Biol, 267, 727- 
748 (1997)), GRAMM (Vakser, LA., Proteins , SuppL, 1:226-230 (1997)), DOCK 
(Kuntz et al, JMolBiol 1982 , 161, 269-288, Makino et al, J.Comput.Chem. 1997, 
18, 1812-1825), AUTODOCK (Goodsell et al, Proteins 1990, 8, 195-202, Morris et 
al, J.Comput.Chem. 1998, 19, 1639-1662.), FlexX, (Rarey et al, JMolBiol 1996, 

15 261, 470-489) or ICM (Abagyan et al, J.Comput.Chem. 1994, 15, 488-506). . This 
procedure can include computer fitting of candidate ligands to PTP1B sulfenyl 
amide to ascertain how well the shape and the chemical structure of the candidate 
ligand will bind to the enzyme. 

Also computer-assisted, manual examination of the active site structure of PTP1B 
20 may be performed. The use of programs such as GRID (Goodford, J. Med. Chem., 
28, (1985), 849-857) - a program that determines probable interaction sites between 
molecules with various functional groups and the enzyme surface - may also be 
used to analyse the active site to predict partial structures of ligands. 

Computer programs can be employed to estimate the attraction, repulsion, and 
25 steric hindrance of the two binding partners (e.g. the PTP1B sulfenyl amide and a 
candidate ligand). Generally the tighter the fit, the fewer the steric hindrances, and 
the greater the attractive forces, the more potent the candidate ligand since these 
properties are consistent with a tighter binding constant. Furthermore, the more 
specificity in the design of a candidate ligand, the more likely it is that it will not 
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interact with other proteins as well. This will tend to minimise potential side- 
effects due to unwanted interactions with other proteins. 

Linked fragment and fragment grow in g approaches. 

If more than one PTP1B sulfenyl amide binding site is characterised and a plurality 
5 of respective compounds are designed or selected, the candidate ligand may be 
formed by linking the respective compounds into a larger compound which 
maintains the relative positions and orientations of the respective compounds at the 
active sites. The larger compound may be formed as a real molecule or by 
computer modelling. 

1 0 Linked-fragment approaches to drug design also require accurate information on the 
atomic coordinates of target receptors. Small compounds which have the potential 
to bind to regions of PTP1B sulfenyl amide which in themselves may not be 
modulator compounds may be assembled by chemical linkage to provide candidate 
modulators. Thus the basic idea behind these approaches is to determine the 

1 5 binding locations of plural ligands to a target molecule, and then construct a 

molecular scaffold to connect the ligands together in such a way that their relative 
binding positions are preserved. The ligands may be provided computationally and 
modelled in a computer system, or provided in an experimental setting, wherein 
crystals according to the invention are provided and a plurality of ligands soaked 

20 separately or in mixed pools into the crystal prior to X-ray analysis and 
determination of their location. 

For example, the binding of one or more molecular fragments can be determined in 
the protein binding cavity by X-ray crystallography. Molecular fragments are 
typically compounds with a molecular weight between 100 and 200 Da. This can 

25 then provide a starting point for medicinal chemistry to optimize the interactions 
using a structure-based approach. The fragments can be combined onto a template 
or used as the starting point for "growing out" a modulator into other cavities of the 
protein. The fragments can be positioned in the binding cavity or cavities of 
PTP1B sulfenyl amide and then 'grown' to fill the space available, exploring the 

30 electrostatic, van der Waals or hydrogen-bonding interactions that are involved in 
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molecular recognition. The potency of the original weakly binding fragment thus 
can be rapidly improved using iterative structure-based chemical synthesis. 

At one or more stages in the fragment growing approach, the compound may be 
synthesized and tested in a biological system for its activity. This can be used to 
5 guide the further growing out of the fragment. 

Where two fragment-binding regions are identified, a linked fragment approach 
may be based upon attempting to link the two fragments directly, or growing one or 
both fragments in the manner described above in order to obtain a larger, linked 
structure which may have the desired properties. 

10 Thus the binding site of two of more ligands are determined and may be connected 
to thus form a potential lead compound that can be further refined using e.g. the 
iterative technique of Greer et al. For a virtual linked-fragment approach see 
Verlinde et al., J. of Computer-Aided Molecular Design, 6, (1992), 13 1-147, and for 
NMR and X-ray approaches see Shuker et al., Science, 274, (1996), 1531-1534 and 

15 Stout et al., Structure, 6, (1998), 839-848. The use of these approaches to design 
PIT IB sulfenyl amide modulators is made possible by the determination of the 
PIT IB sulfenyl amide structure. 

Generation and Analysis of ligand-PTPlB sulfenvl amide complexes 

In a further aspect, the invention provides a method for determining the structure of 
20 a compound bound to sulfenyl amide PTP1B. The methods above may comprise the 
further steps of: 

obtaining or synthesising a candidate modulator; 

forming a complex of PTP1B sulfenyl amide and said candidate modulator; 

and 

25 analysing said complex by X-ray crystallography to determine the ability of 

said candidate modulator to interact with PTP1B sulfenyl amide. 

The invention also provides a method for determining the structure of a compound 
bound to sulfenyl amide PTPlb, said method comprising: (a) providing a crystal of 
sulfenyl amide PTPlb according to the invention; (b) soaking the crystal with said 
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compound; and (c) determining the structure of said sulfenyl amide PTPlb 
compound complex by employing the data of Table 1 or Table 2. 

Alternatively, the sulfenyl amide PTP1B and compound may be co-crystallized. 
5 Thus the invention provides a method for determining the structure of a compound 
bound to sulfenyl amide PTPlb, said method comprising; mixing the protein with 
the compound(s), crystallizing the protein-compound(s) complex; and determining 
the structure of said sulfenyl amide PTPlb -compound(s) complex by reference to 
the data of Table 1 or Table 2. 

10 

A mixture of compounds may be soaked or co-crystallized with the crystal, wherein 
only one or some of the compounds may be expected to bind to the sulfenyl amide 
PTPlb. As well as the structure of the complex, the identity of the complexing 
compound(s) is/are then determined. 

15 In either case, substrate or a substrate analogue thereof may optionally be present. 

The method may comprise the further steps of: (a) obtaining or synthesising said 
compound; (b) forming a complex of sulfenyl amide PTP1B and said compound; 
and (c) analysing said complex by X-ray crystallography or NMR spectroscopy to 
determine the ability of said compound to interact with sulfenyl amide PTPlb. 

20 This information may thus be used to design and synthesize novel classes of 
sulfenyl amide PTP1B inhibitors. 

Detailed structural information can then be obtained about the binding of the 
candidate modulator to PTP1B sulfenyl amide, and in the light of this information 
adjustments can be made to the structure or functionality of the candidate 
25 modulator, e.g. to improve binding to the active site. The above steps may be 
repeated and re-repeated as necessary. 

In another aspect, the invention provides a method of analysing a complex of 
PTP1B sulfenyl amide and a candidate modulator comprising the step of employing 
(i) X-ray crystallographic diffraction data from the complex and (ii) a three- 
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dimensional structure of PTP1B sulfenyl amide, or at least one sub-domain thereof, 
to generate a difference Fourier electron density map of the complex, the three- 
dimensional structure being defined by atomic coordinate data according to Table 1 
or Table 2. 

5 Therefore, such complexes can be crystallised and analysed using X-ray diffraction 
methods, e.g. according to the approach described by Greer et aL, J. of Medicinal 
Chemistry, Vol. 37, (1994), 1035-1054, and difference Fourier electron density 
maps can be calculated based on X-ray diffraction patterns of soaked or co- 
crystallised PTP1B sulfenyl amide and the solved structure of uncomplexed PTP1B 

10 sulfenyl amide. These maps can then be used to determine whether and where a 
particular candidate modulator binds to PTP1B sulfenyl amide and/or changes the 
conformation of PTP1B sulfenyl amide. 

Electron density maps can be calculated using programs such as those from the 
CCP4 computing package (Collaborative Computational Project 4. The CCP4 
15 Suite: Programs for Protein Crystallography, Acta Crystallographica, D50, (1994), 
760-763.). For map visualisation and model building programs such as "O" (Jones 
et aL, Acta Crystallograhy, A47, (1991), 110-1 19) or QUANTA" (1994, San Diego, 
CA: Molecular Simulations, Jones et aL, Acta Crystallography A47 (1991), 1 10- 
119) can be used. 

20 Greer et al. mentioned above describes an iterative approach to ligand design based 
on repeated sequences of computer modelling, protein-ligand complex formation 
and X-ray analysis. Thus novel thymidylate synthase inhibitor series were designed 
de novo by Greer et al., and PTP1B sulfenyl amide inhibitors may also be designed 
in the this way. More specifically, using e.g. GRID on the solved 3D structure of 

25 PTP1B, a candidate modulator for PTP1B sulfenyl amide may be designed that 
complements the functionalities of the PTP1B sulfenyl amide binding site(s). The 
candidate modulator compound can then be synthesised, formed into a complex 
with PTP IB sulfenyl amide, and the complex then analysed by X-ray 
crystallography to identify the actual position of the bound compound. 
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Determination of the position of the candidate modulator in the complex allows 
determination of the interactions of it with PTP IB sulfenyl amide. This will allow 
those of skill in the art to analyse the affinity and specificity of the compound for 
PTP IB sulfenyl amide, and to propose modifications to the compound to increase 
5 or decrease either or both of these properties. Thus the structure and/or functional 
groups of the compound can then be adjusted, if necessary, in view of the results of 
the X-ray analysis, and the synthesis and analysis sequence repeated until an 
optimised compound is obtained. Related approaches to structure-based drug 
design are also discussed in Bohacek et al., Medicinal Research Reviews, Vol. 16, 
10 (1996), 3-50. 

Many of the techniques and approaches to structure-based drug design described 
above require X-ray analysis to identify the binding position of a candidate 
modulator in a complex with a protein. A common way of doing this is to perform 
X-ray crystallography on the complex, produce a difference Fourier electron 
1 5 density map, and associate a particular pattern of electron density with the 

candidate modulator. However, in order to produce the map (as explained e.g. by 
Blundell et al. mentioned above) it is necessary to know beforehand the protein 3D 
structure (or at least the protein structure factors). 

Therefore, determination of the PTP IB sulfenyl amide structure also allows 
20 difference Fourier electron density maps of complexes of PTP IB sulfenyl amide 
with a candidate modulator to be produced, which can greatly assist the process of 
rational drug design. 

The approaches to structure-based drug design described above all require initial 
identification of possible compounds for interaction with target bio-molecule (in 

25 this case PTP IB sulfenyl amide). Sometimes these compounds are known e.g. 
from the research literature. However, when they are not, or when novel 
compounds are wanted, a first stage of the drug design program may involve 
computer-based in silico screening of compound databases (such as the Cambridge 
Structural Database) with the aim of identifying compounds which interact with the 

30 binding site or sites of the target bio-molecule (see Martin, J. Med. Chem. 9 vol 35, 
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2145-2154 (1992)). Screening selection criteria may be based on pharmacokinetic 
properties such as metabolic stability and toxicity. However, determination of the 
PTP1B sulfenyl amide structure allows the architecture and chemical nature of each 
PTP1B active site to be identified, which in turn allows the geometric and 
5 functional constraints of a descriptor for the potential inhibitor to be derived. The 
descriptor is, therefore, a type of virtual 3-D pharmacophore, which can also be 
used as selection criteria or filter for database screening. 

The Crystal Structure of the Catalytic Domain of PTP1B Sulfenvl Amide 

The structure of the portion of the PTP sulfenyl amide corresponding to the 
10 catalytic domain of PTP1B is defined by the atomic coordinates set out in Table 1 
or Table 2. The three dimensional structure of the binding sites of the PTP1B 
sulfenyl amide are shown schematically in Figures lb and lc. 

The crystal structure shows electron density close to the side chain of the catalytic 
cysteine characteristic of the presence of a covalent bond between the sulphur Sy 

15 atom of Cys215 and the backbone nitrogen atom of Ser216 (see Figure lc). The 
sulfenyl-amide bond has a bond length of 1 .7 A and results in a five-membered 
puckered ring that has not been previously observed in proteins. In conjunction 
with the formation of the sulfenyl-amide derivative the phosphate-binding cradle 
adopts a novel conformation, distinct from the structure of the known inactive 

20 C215S PTP IB mutant 16 . The cradle has shifted into the phosphotyrosine binding 
site and stabilises the sulfenyl-amide by a hydrophobic interaction with the side 
chain of Ile219 (Figure lb). In addition, the side chain of Gln262 moves out of the 
active site and also the pTyr loop adopts a unique conformation (Figure lb). The 
more exposed conformation of the pTyr loop results from the loss of the hydrogen 

25 bond between the hydroxyl groups of Tyr46 and Ser216, which anchors the pTyr 
loop in native PTP IB and is stabilised by a network of water molecules mediating 
interactions between Asp48 and the rest of the protein. 

Formation of the sulfenyl-amide arises from oxidation of the active site Cys215, 
most likely via oxidation of Cys215 to sulfenic acid, followed by a nucleophilic 
30 attack of the backbone nitrogen atom of Ser2 1 6 on the Sy atom of Cys2 1 5 . Indeed, 
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it has been postulated that the hydrogen bond interaction between the carbonyl 
oxygen atom of Cys215 and the Nl atom of the invariant His214 side chain in 
native PTP1B increases the partial charge on the backbone nitrogen atom of 
Ser216 12 , enhancing its reactivity and supporting a nucleophilic substitution 
5 mechanism. In vivo the sulfenic acid can be formed by oxidation with hydrogen 
peroxide (H2O2) 17 , but under our experimental conditions it is most likely triggered 
by redox-cycling of 2-Phenyl-isoxazolidine-3,5-dione (figure 2). The leaving group 
in the cyclisation reaction could be a water molecule in an SN2 substitution reaction 
(direct mechanism, Figure 2). Alternatively, the sulfenic acid derivative might be 
10 oxidised under the experimental soaking conditions, or by physiological oxidants 
such as hydrogen peroxide or oxidised glutathione in vivo, to form a highly reactive 
intermediate. This would result in faster formation of the sulfenyl-amide species 
(oxidative mechanism, Figure 2). 

The role for the sulfenyl-amide intermediate in oxidative regulation of PTP1B in 
15 cells is likely to be the prevention of irreversible oxidation of Cys215 and thus 
facilitation of its thiol-mediated reactivation (Figure 2). To demonstrate the 
reversibility of the S-N bond, reducing conditions were employed in an attempt to 
reduce the sulfenyl-amide derivative in the crystals. First, two PTP1B crystals were 
soaked in a solution containing 2-Phenyl-isoxazolidine-3,5-dione. X-ray data 
20 collected from one of the crystals confirmed the formation of the sulfenyl-amide 
bond and the concomitant conformational changes in the active site. The second 
crystal was back-soaked in 20mM reduced glutathione in an attempt to reduce the 
Cys215 sulfenyl-amide derivative back to its native form. Indeed, X-ray data 
collected from the back-soaked crystal showed the entire active site back in its 
25 native conformation, thus structurally confirming reactivation of the sulfenyl-amide 
PTP1B derivative by a biologically relevant reducing agent and strengthening the 
hypothesis of a protective role in PTP1B redox-regulation. 

The studies carried out provide a detailed structural understanding of the 
intermediates involved in redox-regulation of PTP1B and reveal a novel oxidation 
30 state of its catalytic cysteine. The formation of the sulfenyl-amide intermediate is 
an elegant mechanism to protect Cys215 from further oxidation, and the 
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concomitant conformational changes of the phosphate-binding cradle and pTyr loop 
may serve to signal the inactive state of the enzyme. The structures of the sulfenic 
acid and sulfenyl-amide derivative indicate that reactivation of PTP1B appears to 
be facilitated by the sulfenyl-amide form. 

5 The sulfenyl-amide form of PTPs is an important regulatory intermediate of these 
proteins. It is stable to oxidation to the sulfinic or sulfonic protein forms that are 
irreversibly inhibited and therefore prevents permanent inactivation of the protein. 
It can then subsequently be converted to the reduced, active form of the protein by 
physiological reducing agents such as thiols. The sulfenyl amide is an 
1 0 isothiazolidin-3-one ring system, which has not been previously observed in 

proteins. It is an electrophilic species in the active site of an enzyme, and this is also 
very unusual as most enzymes display nucleophiles in their catalytic machinery. 

Therapeutic and Medical Uses of the Inhibitors of PTP Sulfenyl Amide 

It is envisaged that compounds that stabilize the sulfenyl amide form or effect 
15 reversible or irreversible covalent modification of the sulfenyl amide form will be 
useful as therapeutic agents in the treatment of disease states or conditions mediated 
by protein tyrosine phosphatases. 

Accordingly, in one aspect, the invention provides the use of a compound for the 
manufacture of a medicament for the treatment of a disease or condition mediated 
20 by protein tyrosine phosphatase, wherein the compound is one that binds to protein 
tyrosine phosphatase sulfenyl amide to prevent or inhibit conversion of the protein 
tyrosine phosphatase sulfenyl amide to an active reduced form of the protein 
tyrosine phosphatase. 

In a further aspect, the invention provides a method of reducing the activity of a 
25 protein tyrosine phosphotase (PTP), the PTP being one which is convertible 
between an active form and an inactive or less active form, the inactive or less 
active form being characterised by the presence of a sulfenyl amide moiety formed 
at the active site of the PTP between the sulphur atom of a cysteine group and a 
backbone nitrogen atom of a neighbouring amino acid, whereby the sulfenyl amide 
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moiety distorts and inactivates the active site of the PTP and wherein the sulfenyl 
amide moiety is disruptible to restore the inactive or less active form of the PTP to 
the active form thereof; 

which method comprises inhibiting disruption of the sulfenyl amide moiety, 
5 or modifying the sulfenyl amide moiety to prevent restoration of the inactive or less 
active form of the PTP to the active form. 

In another aspect, the invention provides a method of inhibiting or preventing the 
reduction of sulfenyl amide PTP1B to PTP1B in a cellular environment by exposing 
the PTP IB to a ligand capable of binding to the sulfenyl amide PTP IB in the region 
10 of the sulfenyl amide moiety so as to prevent reduction of the sulfenyl amide 
moiety by an intracellular reducing agent. 

The invention also provides a method of inhibiting or preventing the reduction of 
sulfenyl amide PTP1B to PTP1B in a cellular environment by exposing the PTP1B 
to a ligand capable of binding to the sulfenyl amide PTP IB in the region of the 
15 sulfenyl amide moiety, the ligand having a nucleophilic moiety capable of 
modifying the sulfenyl amide moiety so as to prevent its reduction by an 
intracellular reducing agent. 

The Compounds of the Invention 

The invention provides novel compounds per se that inhibit protein tyrosine 
20 phosphatases by interacting with the sulfenyl amide PTP to prevent or inhibit 
conversion of the PTP sulfenyl amide to an active reduced form of the protein 
tyrosine phosphatase. 

In addition to compounds per se, the invention provides compounds of the aforesaid 
type for use in therapy or for use in medicine, for example for use in the treatment 
25 of diseases or conditions mediated by protein tyrosine phosphatase. 

Conversion of the PTP sulfenyl amide to the corresponding active reduced form can 
be inhibited in several ways by small molecule ligands. 
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Mode 1 . Non-covalent binding inhibitors that stabilise the sulfenyl-amide protein 
form. These inhibitors are designed to prevent physiological cell cycling of the 
protein form into its active reduced form by preventing binding to the protein. 

Mode 2. Reversible covalent binding inhibitors that modify the sulfenyl-amide form 
of the protein. These inhibitors are designed to react with the active site sulfenyl 
amide, and in so doing, prevent its reactivation by physiological cell cycling. 

Mode 3. Irreversible covalent binding inhibitors that modify the sulfenyl-amide 
form of the protein. These inhibitors are designed to react with the active site 
sulfenyl amide, and in so doing, prevent its reactivation by physiological cell 
cycling. 

The concerted distortion of the phosphate binding cradle and phosphotyrosine 
recognition loop upon sulfenyl-amide formation destroys the normal 
phosphotyrosine binding site and creates a new groove (referred to herein as "the 
first binding site") in which small molecules could bind. This groove is lined by 
residues 41-47 of the phosphotyrosine recognition loop, residues 88-90, 1 15 to 120, 
residues 179 to 1 84 of the WPD-loop, residues 215 to 219 of the phosphate-binding 
cradle, and residues 262-266. 

Compounds having Mode 1 activity include compounds that can make polar 
interactions at the first binding site with one or more of: 



(1) 


Lys41 


(2) 


Asn42 


(3) 


Arg45 


(4) 


Tyr46 


(5) 


Arg47 


(6) 


Asn90 


(7) 


Glnll5 


(8) 


Lysll6 


(9) 


Serll8 


(10) 


Lysl20 


(11) 


Trpl79 
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15 



(12) Ser216 

(13) Arg221 

(14) Gln262 

(15) Thr263 

(16) Asp265,and 

(17) GIn266 

The amino acid numbering convention used above refers to the numbering of 
PTP1B. 

Preferably, the compounds make polar interactions with two or more of the listed 
moieties (1) to (17), more preferably three or more, for example four or more, and 
more particularly five or more. 

The compounds can make hydrophobic interactions with one or more of: 



(18) 


Leu88 


(19) 


Pro89 


(20) 


Leull9 


(21) 


Phel82 


(22) 


Glyl83 


(23) 


Vail 84 


(24) 


Ala217 


(25) 


Ile219 


(26) 


the apolar part of Arg221, and 


(27) 


the apolar part of Gln262. 



Preferably, the compounds make hydrophobic interactions with two or more of the 
listed moieties, more preferably three or more, for example five or more. 

Additional hydrogen bonds and hydrophobic interactions may be formed between a 
bound molecule and the protein backbone. 

A second shallow depression in which a small molecule could bind (referred to 
hereinafter as "the second binding site") is located on the other side of the distorted 
protein tyrosine recognition loop and includes residues from the second phosphate 
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binding site in PTP1B. This potential binding area is roughly defined by residues of 
the WPD-loop, the pTyr recognition loop and the loop containing residues 28-32. 

Compounds having Mode 1 activity include compounds that can make polar 
interactions at the second binding site with one or more of: 



5 


(44) 


Arg24 




(14) 


Gln262 




(45) 


Arg254 




(46) 


Asn 44 




(5) 


Arg47 


10 


(4) 


Tyr46 




(1) 


Lys41 




(47) 


Lys36 




(48) 


Asp29 




(49) 


Cys32 


15 


(50) 


Ser50 



The compounds can make hydrophobic interactions with one or more of: 



(51) 


Leu250 


(14) 


Gln262 


(41) 


Met258 


20 (35) 


Val49 


(4) 


Tyr46 


(39) 


Gly218 


(52) 


Gly259 


(53) 


Phe52 


25 (42) 


Leu260 


(54) 


Leu261 


(55) 


Ala35 and 


(56) 


the backbone of Asp48. 



In the middle of this binding area a third potential binding site (hereinafter referred 
30 to as "the third binding site") has been created as a result of the distortion of the 
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phosphate-binding cradle. This water filled cavity is located directly under the 
distorted phosphate-binding cradle and has a narrow entrance between residues 
Val49, Gly218 and Gin 262. The cavity walls are formed by Asp48, Val49, Leu83, 
Gly218, Gly220, Ser222, Arg257, Gly259, Gln262 and the sulfenyl-amide. 

5 Accordingly, compounds having Mode 1 activity include compounds that can make 
polar interactions at the third binding site with one or more of: 
(3) Arg45 

(29) Asp48 

(30) Ser222 
10 (31) Arg257 

(14) Gln262 

(33) the protein backbone of one or more of (i) Thr84, (ii) Gly218, (iii) Gly220, 
(iv) Gly223, (v) Met258, (vi) and Gly259; 

(34) and the sulfenyl-amide residue. 

1 5 Preferably the compounds can make polar interactions at two or more (more 

preferably three or more, four or more, or five or more) of the residues (3), (29) to 

(31) , (14), (33) and (34). 

Hydrophobic interactions with the compound can be made at the third binding site 



20 by: 




(35) 


Val49 


(36) 


Leu83 


(37) 


Gln85 


(38) 


Gly86 


25 (39) 


Gly218 


(40) 


Gly220 


(41) 


Met258 


(42) 


Leu260 and 


(43) 


the main chain of His214. 
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Preferred interactions between the compounds of the invention and the three 
binding sites described above are as follows: 

First binding site: 

Polar: Trp79, Arg221, Lys46, Glu266, Arg45, Serll8 
5 Hydrophobic: Ile219, Leu88, Ilel20 

Second binding site: 

Polar: Arg254, Lys36, Asp29, Gln262 

Hydrophobic: Met258, Val49 

10 

Third binding site (cavity): 
Polar: Arg257, Asp48, Ser222 
Hydrophobic: sulfenyl amide, Val 49, Leu83 

15 As described above, compounds useful in the invention include those that bind to 
the sulfenyl amide PTP in the region of the sulfenyl amide moiety thereby to 
prevent reduction or other reaction of the sulfenyl amide with an endogenous 
intracellular molecule such as glutathione and conversion back to the active form of 
the PTP. However, in an alternative embodiment of the invention, the compounds 

20 can be ligands that possess a nucleophilic functional group that can react either 
reversibly (Mode 2) or irreversibly (Mode 3) with the electrophilic sulfenyl amide 
active. Scheme 1 below illustrates how ligands inhibit the action of the protein in 
the cell by preventing it from being converted back to an active form. 
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SH 


Oxidation 


(active) 






Reduction 




f RSH 




^ RS 


— SR 


E— « 


5— SR - 


RSH 



E-S-OH 



Cyclisation 



T ftll 




Sulfenyl Amide 
Enzyme Intermediate 
(inactive) 

Nuc-L 



Covalentiy modified protein 
(inactive) 



SCHEME 1 



The compounds of the invention can thus take the form of nucleophilic ligands, 
having a nucleophilic group that will react with the sulfenyl amide moiety, and a 
binding region for binding to the sulfenyl amide PTP in the region of the sulfenyl 
amide moiety. The binding region can be one that exhibits one or more of the polar 
and non-polar interactions 1 to 56 set out above in relation to Mode I compounds. 

The nucleophilic group will typically contain a heteroatom (e.g. selected from 
nitrogen, sulphur, oxygen and phosphorus) that is either neutral or negatively 
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charged, and which may be located adjacent a carbon atom or another heteroatom, 
which is capable of reacting with the sulfenyl amide species. Nitrogen, oxygen and 
sulfur nucleophiles are preferred. 

Thus, the sulfenyl amide moiety can be modified by reaction with a nucleophilic 
5 ligand to prevent it from reverting to the active form of the enzyme. 

The nucleophilic group is selected from the group consisting of a thiol, disulfane, 
primary thioamide, secondary thioamide, primary thiourea, secondary thiourea, 
primary amine, secondary amine, primary hydrazine, secondary hydrazine, primary 
hydrazide, secondary hydrazide, primary hydrazone, secondary hydrazone, primary 
10 amide, secondary amide, primary urea, secondary urea, primary sulfonamide, 
secondary sulfonamide, 5-membered ring heterocycle containing NH, alcohol, 
hydroxylamine, oxime, hydroxamic acid, carboxylic acid (preferably other than an 
oxalamic acid), sulfoxide, sulfate and a nitrone. 

Particular examples of nucleophiles are set out in Table 3 below. 

1 5 Table of Nucleophiles 

j 

NuoL 

Nuc = nucleophile 
L = Ligand 



Table 3 



Type of nucleophile 


Structure 


Name 


Sulphur 


L— SH 


Thiol 
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L — S— SH 


Disulfane 




A 

MM 

2 


Primary Thioamide 




IM 

H 


Secondary Thioamide 




L fj NH 2 


Primary thiourea 




S 


Secondary thiourea 


Nitrogen 


L— NH 2 


Primary amine 




L— N— L' 
H 


Secondary amine 




L-N-NH 2 


Primary Hydrazine 




H H 


Secondary Hydrazine 




L N-NH 2 
ii 


Primary Hydrazide 




A H 

H L 1 


Secondary Hydrazide 



WO 2004/087905 



57 



PCT/GB2004/001446 





L=N-NH 2 


Primary Hydrazone 




L — N — N — L 
H 


Secondary Hydrazone 




Kill 

rvlrf 2 


Primary amide 




N L 
H 


Secondary amide 




x 

L— N NH 2 


Primary urea 




O 

L _ N JL. N _ L , 

H H 


Secondary urea 




L-S-NH 2 
O 


Primary Sulfonamide 




O 

II 

L— S— N— L' 
II H 
O 


Secondary Sulfonamide 






5-membered ring heterocycle 
containing NH 


Oxygen 


L-OH 


Alcohol 
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L— N-OH 
H 


Hydroxylamine 




L=N— OH 


Oxime 




L ~"^WOH 
N wn 

H 


Hydroxamic acid 




o 

OH 


^aiuuAyiic dt/ici ypreieraoiy 
not oxalamic acids) 




+ 

L — S— O 


Sulfoxide 




O 

II 

L-S-O 

o 


Sulfate 




+ - 

L=N-0 


Nitrone 



In the table, the symbols L and L' represents the residue of the ligand, other than 
the nucleophilic group. The residue may of course contain one or more further 
nucleophilic groups of the type shown. 

5 Compounds of the invention that are nucleophilic ligands will form new covalently 
bound protein-ligand species. In some cases [Mode 2\ the protein-ligand species is 
capable of undergoing the reverse reaction to reform the sulfenyl amide. In other 
cases {Mode 3), the nucleophiles will form a covalently bound protein-ligand 
complex in which the reverse reaction does not occur in the environment of the 
10 active site, or occurs very slowly, so that the complex is formed irreversibly. 
Additionally, certain covalent protein-ligand complexes, formed by reaction of 
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nucleophilic ligands with the sulfenyl amide protein, may undergo additional 
reactions that prevent the reverse reaction from occurring, resulting in irreversible 
inhibition. 

For example, where the nucleophilic heteroatom is an oxygen atom, the resulting 
covalent protein-ligand complex will contain a sulfur oxygen bond which is 
therefore oxidised. Subsequent further oxidation under cellular conditions could 
lead to oxidation of the protein to sulfinyl or sulfonic acid oxidation states, 
irreversibly modifying the protein. 

The compounds of the invention are typically synthetic compounds that are not 
normally encountered in a cellular environment, although naturally occurring 
compounds derived from plant sources, marine sources or other non-mammalian 
sources may be used where appropriate. 

The compounds of the invention are typically organic compounds and can be non- 
peptides, peptides or modified peptides. In one embodiment, the compounds are 
not peptides. 

The compounds of the invention may comprise a scaffold formed from one or more 
optionally substituted carbocyclic or heterocyclic ring systems, the ring systems 
and/or the substituents having one or more polar or non-polar moieties for 
interacting with one or more, preferably a plurality of the binding sites 1 to 43 listed 
above. 

The carbocyclic and heterocyclic ring systems can be aromatic or non-aromatic ring 
systems. When the carbocyclic or heterocyclic groups are aryl or heteroaryl groups, 
they can have, for example, from 5 to 12 ring members, more usually from 5 to 10 
ring members. The term "aryl" as used herein refers to a carbocyclic group having 
aromatic character and the term "heteroaryl" is used herein to denote a heterocyclic 
group having aromatic character. The terms "aryl" and "heteroaryl" embrace 
polycyclic (e.g. bicyclic) ring systems wherein one or more rings are non-aromatic, 
provided that at least one ring is aromatic. The aryl or heteroaryl groups can be 
monocyclic or bicyclic groups and can be unsubstituted or substituted with one or 
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more substituents. Examples of heteroaryl groups are monocyclic and bicyclic 
groups containing from five to twelve ring members, and more usually from five to 
ten ring members. The heteroaryl group can be, for example, a five membered or 
six membered monocyclic ring or a bicyclic structure formed from fused five and 
5 six membered rings or two fused six membered rings. Each ring may contain up to 
about four heteroatoms typically selected from nitrogen, sulphur and oxygen. 
Typically the heteroaryl ring will contain up to 3 heteroatoms, more usually up to 2, 
for example a single heteroatom. In one embodiment, the heteroaryl ring contains 
at least one ring nitrogen atom. The nitrogen atoms in the heteroaryl rings can be 
1 0 basic, as in the case of imidazole or pyridine, or essentially non-basic as in the case 
of an indole or pyrrole nitrogen. In general the number of basic nitrogen atoms 
present in the heteroaryl group, including any amino group substituents of the ring, 
will be less than five. 

15 Examples of heteroaryl groups include but are not limited to pyridyl, pyrrolyl, 
furanyl, thiophenyl, imidazolyl, oxazolyl, oxadiazolyl, oxatriazolyl, isoxazolyl, 
thiazolyl, isothiazolyl, pyrazolyl, pyrazinyl, pyrimidinyl, triazinyl, triazolyl, 
tetrazolyl, quinolinyl, isoquinolinyl, benzfuranyl, benzthiophenyl, chromanyl, 
thiochromanyl, benzimidazolyl, benzoxazolyl, benzisoxazole, benzthiazolyl and 

20 benzisothiazole, isobenzofuranyl, isoindolyl, indolizinyl, indolinyl, isoindolinyl, 
purinyl (e.g., adenine, guanine), indazolyl, benzodioxolyl, chromenyl, 
isochromenyl, chroman, isochromanyl, benzodioxanyl, quinolizinyl, benzoxazinyl, 
benzodiazinyl, pyridopyridinyl, quinoxahnyl, quinazolinyl, cinnolinyl, phthalazinyl, 
naphthyridinyl and pteridinyl. 

25 Examples of carbocyclic aryl groups include phenyl, naphthyl, indenyl, and 
tetrahydronaphthyl. 

Examples of non-aromatic heterocyclic groups are groups having from 3 to 12 ring 
members, more usually 5 to 10 ring members. Such groups can be monocyclic or 
30 bicyclic, for example, and typically have from 1 to 5 heteroatom ring members 
(more usually 1, 2, 3 or 4 heteroatom ring members), usually selected from 
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nitrogen, oxygen and sulphur. The heterocylic groups can contain, for example, 
cyclic ether moieties (e.g as in tetrahydrofuran and dioxane), cyclic thioether 
moieties (e.g. as in tetrahydrothiophene), cyclic amine moieties (e.g. as in 
pyrrolidine), cyclic sulphones (e.g. as in sulfolane and sulfolene)), cyclic 
sulphoxides, cyclic sulphonamides and combinations thereof. 

Particular examples include morpholine, piperidine, pyrrolidine, pyrrolidone, 
tetrahydrofuran, tetrahydrothiophene, dioxan, tetrahydropyran, imidazoline, 
imidazolidinone, oxazoline, thiazoline, piperazine, and N-alkyl piperazines such as 
N-methyl piperazine. In general, preferred non-aromatic heterocyclic groups 
include tetrahydrofuran, morpholine, piperazine, piperidine, pyrrolidine and 
pyrrolidone. 

The carbocyclic and heterocyclic groups can each be unsubstituted or substituted by 
one or more substituent groups selected from halogen, hydroxy, trifluoromethyl, 
cyano, nitro, carboxy, amino, carbocyclic and heterocyclic groups having from 3 to 
12 ring members; a group R a -R b wherein R a is a bond, O, CO, X ] C(X 2 ), CCX^X 1 , 
X'CCXVC 1 , S, SO, SO2, NR°R d , SO2NR 0 or NR c SO a ; and R b is selected from 
hydrogen, carbocyclic and heterocyclic groups having from 3 to 7 ring members, 
and a C^g hydrocarbyl group optionally substituted by one or more substituents 
selected from hydroxy, oxo, halogen, cyano, nitro, amino, mono- or di-C M 
hydrocarbylamino, carbocyclic and heterocyclic groups having from 3 to 12 ring 
members and wherein one or more carbon atoms of the Ci. 8 hydrocarbyl group may 
optionally be replaced by O, S, SO, SCb, NR°, X'CCX 2 ), CCX^X 1 or X^pt^X 1 ; 

R° and R d are the same or different and each is hydrogen or C M 
hydrocarbyl; 

X 1 is O, S or NR C and X 2 is =0, =S or =NR°. 

Where the substituent group comprises or includes a carbocyclic or heterocyclic 
group, the said carbocyclic or heterocyclic group may be unsubstituted or may itself 
be substituted with one or more further substituent groups. In one sub-group of 
compounds of the formula (I), such further substituent groups may include 
carbocyclic or heterocyclic groups, which are typically not themselves further 
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substituted. In another sub-group of compounds of the formula (I), the said further 
substituents do not include carbocyclic or heterocyclic groups but are otherwise 
selected from the groups listed above in the definition of the substituents. 

5 Examples of halogen substituents include fluorine, chlorine, bromine and iodine. 
Fluorine and chlorine are particularly preferred. 

In the definition of the compounds above and as used hereinafter, the term 
"hydrocarbyl" is a generic term encompassing aliphatic, alicyclic and aromatic 

10 groups having an all-carbon backbone, except where otherwise stated. Examples of 
such groups include alkyl, cycloalkyl, cycloalkenyl, carbocyclic aryl, alkenyl, 
alkynyl, cycloalkylalkyl, cycloalkenylalkyl, and carbocyclic aralkyl, aralkenyl and 
aralkynyl groups. Such groups can be unsubstituted or substituted by one or more 
substituents as defined herein. The examples and preferences expressed below 

15 apply to each of the hydrocarbyl substituent groups or hydrocarbyl-containing 
substituent groups referred to in the various definitions of substituents for 
compounds of the invention unless the context indicates otherwise. 

The term "alkyl" covers both straight chain and branched chain alkyl groups. 
20 Examples of alkyl groups include methyl, ethyl, propyl, isopropyl, n-butyl, isobutyl, 
tert-butyl, n-pentyl, 2-pentyl, 3-pentyl, 2-methyl butyl, 3-methyl butyl, and n-hexyl 
and its isomers. 

Examples of cycloalkyl groups are those derived from cyclopropane, cyclobutane, 
25 cyclopentane, cyclohexane and cycloheptane. 

Examples of alkenyl groups include, but are not limited to, ethenyl (vinyl), 1- 
propenyl, 2-propenyl (allyl), isopropenyl, butenyl, buta-l,4-dienyl, pentenyl, and 
hexenyl. 

30 

Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, 
cyclobutenyl, cyclopentenyl, cyclopentadienyl and cyclohexenyl. 
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Examples of alkynyl groups include, but are not limited to, ethynyl and 2-propynyl 
(propargyl) groups. 

5 Examples of carbocyclic aryl groups include substituted and unsubstituted phenyl. 

Examples of cycloalkylalkyl, cycloalkenylalkyl, carbocyclic aralkyl, aralkenyl and 
aralkynyl groups include phenethyl, benzyl, styryl, phenylethynyl, 
cyclohexylmethyl, cyclopentylmethyl, cyclobutylmethyl, cyclopropylmethyl and 
10 cyclopentenylmethyl groups. 

The definition "R a -R b " as used herein, includes inter alia compounds wherein R a is 
selected from a bond, O, CO, OC(O), SC(O), NR c C(0), OC(S), SC(S), NR C C(S), 
OC(NR°), SC(NR C ), NR C C(NR C ), C(0)0, C(0)S, C(0)NR c , C(S)0, C(S)S, C(S) 

15 NR°, C(NR c )0, C(NR C )S, C(NR C )NR°, OC(0)0, SC(0)0, NR c C(0)0, OC(S)0, 
SC(S)0, NR 0 C(S)O, OC(NR c )0, SC(NR c )0, NR c C(NR c )0, OC(0)S, SC(0)S, 
NR c C(0)S, OC(S)S, SC(S)S, NR C C(S)S, OC(NR°)S, SC(NR C )S, NR C C(NR C )S, 
OC(0)NR c , SC(0)NR c , NR c C(0) NR C , OC(S)NR c , SC(S) NR°, NR C C(S)NR C , 
OC(NR c )NR c , SC(NR C )NR C , NR C C(NR°NR C , S, SO, S0 2 ,NR c R d , S0 2 NR c and 

20 NR C S02 wherein R c is as hereinbefore defined. 

The moiety R b can be hydrogen or it can be a group selected from carbocyclic and 
heterocyclic groups having from 3 to 12 ring members (typically 3 to 10 and more 
usually from 5 to 10), and a Ci_ 8 hydrocarbyl group optionally substituted as 
25 hereinbefore defined. 

Examples of hydrocarbyl, carbocyclic and heterocyclic groups are as set out above. 

When present, the hydrocarbyl group can be substituted by one or more substituents 
30 selected from hydroxy, oxo, halogen, cyano, nitro, amino, mono- or di-Ci-4 

hydrocarbylamino, and monocyclic carbocyclic and heterocyclic groups having 
from 3 to 12 (typically 3 to 10 and more usually 5 to 10) ring members. Preferred 
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substituents include halogen such as fluorine. Thus, for example, the substituent 
can be a partially fluorinated or perfluorinated group such as trifluoromethyl. 

One or more carbon atoms of the C x .g hydrocarbyl group may optionally be 
replaced by O, S, SO, S0 2 , NR C , X'C(X 2 ), C(X 2 )X* or X'CpC^X 1 wherein X 1 and 
X 2 are as hereinbefore defined. For example, 1, 2, 3 or 4 carbon atoms of the 
hydrocarbyl group may be replaced by one of the atoms or groups listed, and the 
replacing atoms or groups may be the same or different. Examples of groups in 
which a carbon atom of the hydrocarbyl group has been replaced by a replacement 
atom or group as defined above include ethers and thioetihers (C replaced by O or 
S), amides, esters, thioamides and thioesters (C replaced by X*C(X 2 ) or CCX^X 1 ), 
sulphones and sulphoxides (C replaced by SO or S0 2 ) and amines (C replaced by 
NR C ). 

The combination of ring system and substituents is chosen so as to give a desired 
level of interaction with residues (1) to (56) in the three binding sites defined above. 
The extent of the interaction between the compound and the binding sites of the 
PTP sulfenyl amide can be gauged using the computer based modelling methods 
discussed above based on the atomic coordinates set out in Table 1 or Table 2. 

ha general, the number of interactions between the compound and the PTP sulfenyl 
amide may be chosen so as to optimise the binding of the compound to the PTP 
sulfenyl amide. For Mode 1 inhibitor compounds, in one embodiment, it is 
preferred to maximise the number of interactions between the compound and the 
first and/or second and/or third binding sites so as to provide enhanced binding to 
the PTP sulfenyl amide. 

For example, where the compound is designed to bind to the first binding site, it is 
preferred that the compound makes polar interactions with at least seven, more 
usually at least ten, and preferably at least twelve of the residues (1) to (17) and 
hydrophobic interactions with at least two and more preferably at least four of the 
residues (18) to (27). 
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Where the compound is designed to bind to the second binding site, it is preferred 
that the compound forms polar interactions with at least two, more usually three, 
and preferably four of the residues (44), (14), (45), (46), (5), (4), (1), (47), (48), 
(49) and (50), and hydrophobic interactions with at least one or two of the residues 
5 (5 1), (14), (41), (35), (4), (39), (52), (53), (42), (54), (55) and (56). 

Where the compound is designed to bind to the third binding site, it is preferred that 
the compound forms polar interactions with at least two, more usually three, and 
preferably at least four of the residues (3), (29), (30), (31), (14), (33) and (34), and 
1 0 preferably at least two and more usually at least three hydrophobic interactions with 
the residues (35), (36), 37), (38), (39), (40), (41), (42) and (43). 

In one embodiment of the invention, the compound forms interactions at two or 
more of the first, second and third binding sites, for example, (i) the first and second 
binding sites, or (ii) the first and third, or (iii) the second and third binding sites. 

15 In another embodiment, the compound forms interactions with only the first binding 
site. 

In a further embodiment, the compound forms interactions with only the second 
binding site. 

In a further embodiment, the compound forms interactions with only the third 
20 binding site. 

In a still further embodiment, the compound forms interactions with all three 
binding sites. 

A compound is considered to have formed an interaction with a given residue at a 
binding site if the proximity between a compound or portions thereof to the 
25 molecule or portions thereof wherein the juxtaposition is energetically favored by 
electrostatic or van der Waals interactions. The distance will depend on the type of 
interaction made-hydrogen bond, salt bridge or stacking interaction 
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The term hydrogen bond refers to a favorable interaction that occurs whenever a 
suitable donor atom, Q x , bearing a proton, H, and a suitable acceptor atom, Q Y , 
have a separation of < 3.5A and where the angle Q X -H-Q Y is greater than 90 
degrees. Sometimes, a single proton on a donor atom Q x may form a plurality of 
5 suitable acceptor atoms, Q Y . For example, the proton on a -NH-group may form a 
separate hydrogen bond with each of the two oxygen atoms in a carboxylate anion. 
Suitable donor and acceptor atoms are well understood in medicinal chemistry 
(G.C. Pimentel and A.L. McClellan, The Hydrogen Bond, Freeman, San Francisco, 
1960; R. Taylor and 0. Kennard, Hydrogen Bond Geometry in organic Crystals, 
10 Accounts of Chemical Research, 17, pp. 320-326 (1984)). 

The term "hydrogen bonding moiety" refers to a chemical structure containing one 
or more suitable hydrogen bond donor moieties or hydrogen bond acceptor 
moieties. 

The term "hydrogen bonding donor moiety" refers to a chemical structure 
1 5 containing a suitable hydrogen bond donor atom bearing one or more protons. 
Examples of donor atoms having one proton are -OH, -SH and -NH-. 

Examples of donor atoms having more than one proton are -NH 2 , [-NH 3 ]- and [- 
NH 2 1+ 

The term hydrogen bonding acceptor moiety refers to a chemical structure 
20 containing a suitable hydrogen bond acceptor atom. Examples of acceptor atoms 
include fluorine, oxygen, sulfur and nitrogen. 

The term stacking interaction refers to the favorable attractive interactions between 
two aromatic ring systems, wherein the two rings are juxtaposed such that they are 
oriented either face-to-face, perpendicular or at an intermediate angle to each other. 

25 Face-to-face stacking interactions are usually between 3.5-4.5 Angstrom. Face-edge 
stackings are usually to be between 3.5 and 4 A. Most aromatic protein interactions 
involve separation distances of 3.6 to 3.8 Angstrom. Fully stacked interactions are 
not usually observed. Most common are staggered stacked structures with tilted 
rings. Perpendicular stacking may be face-edge or cogwheel. Both are common. 
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(Protein-protein recognition via side-chain interactions; Thornton et al.; 
Biochemical society transactions.; 927-930 (1988)). 

The term salt bridge refers to the non-covalent attractive interaction between a 
positively charged moiety (P) and a negatively charged moiety (N) when the 
distance between the centers of mass of P and N is between 2 and 6 Angstroms. In 
calculating the center of mass, atoms which may contain a formal charge and atoms 
immediately adjacent to these are included. For example, a salt bridge may be 
formed between the positively charged guanidinium side chain of an arginine 
residue and the negatively charged carboxylate side chain of a glutamate residue. 
Salt bridges are well known in medicinal chemistry (L. Stryer, Biochemistr , 
Freeman, San Francisco, (1975); K.A. Dill, Dominant Forces in Protein Folding, 
Biochemistry, 29, No. 31, pp. 7133- 7155, (1990)) . 

The term center of mass refers to a point in three-dimensional space that represents 
a weighted average position of the masses that make up an object. 

In order to form a desired number of binding interactions with the first binding site, 
the compound preferably has a binding domain no longer than about 35 Angstrom 
long (the length of the first binding groove). Typically compounds will be, for 
example, 5-30, 5-25, 5-15, 10-15 A in length. 

In order to form a desired number of binding interactions with the second binding 
site, the compound preferably has a binding domain that can fit into an area of 
about 30 by 30 Angstrom (the area of the second binding site): In order optimise 
interactions at mis binding site, compounds will typically be, for example, 5-30, 5- 
25, 5-15, 10-15 A in length. 

The third binding site, the cavity underneath the cradle, is about 9 by 1 5 Angstrom 
so only small binding domains or small molecules will fit in this cavity, for 
example those that are 5-15, 10-15 A in length. 

A compound of the invention can possess only a single binding domain or can have 
binding domains enabling it to bind to two or three binding sites. 
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Compounds exhibiting Type 2 or Type 3 activity typically have a binding domain 
enabling them to bind to the first and/or second and/or third binding sites so as to 
bring the nucleophilic group into reactive proximity of the sulfenyl amide group. 
The nucleophilic groups can form part of the scaffold described above or can take 
5 the form of substituents attached to the scaffold. 

Screening of compounds 

In another aspect, in place of in silico methods, high or low throughput screening of 
compounds to select compounds with binding activity may be undertaken, and 
those compounds which show binding activity may be selected as possible 
10 candidate modulators, and further crystallized with PTP1B sulfenyl amide (e.g. by 
co-crystallization or by soaking) for X-ray analysis. The resulting X-ray structure 
may be compared with that of Table 1 or Table 2 for a variety of purposes. The 
screen may utilise any of the assays detailed below. 

Assays for Screening for Active Compounds 

15 Compounds may be identified in high-throughput or low-throughput screening as 
outlined above, utilizing the assays detailed below. Compounds screened may 
include those available from commercially available sources, compounds generated 
by standard synthetic chemistry methods, or those that are part of a corporate 
compound collection. 

20 Alternatively, once a candidate inhibitor compound has been identified, for example 
by computer based rational drug design techniques as described above, the 
compounds are synthesized and tested. Whether or not the compounds are 
inhibitors of the PTP sulfenyl amide can be determined by one of a number of 
assays. Consequently, all the methods of compound design and identification 

25 above (e.g. in silico analysis, ligand-sulfenyl amide PTP structure determination 
etc) preferably further comprise the further steps of: 

obtaining or synthesising the candidate modulator; and 
contacting the candidate modulator with PTP1B sulfenyl amide to determine 
the ability of the candidate modulator to interact with PTP1B sulfenyl amide. 
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For example, in one assay, the oxidized form (sulfenyl amide form) of a 
recombinant or extracted protein tyrosine phosphatase is incubated with a candidate 
binding compound and a determination is made as to whether the compound is able 
to interact with the oxidized (sulfenyl amide) form of the protein tyrosine 
5 phosphatase. 

Such assays require the formation of the oxidized form of a protein tyrosine 
phosphatase. The oxidized form can be produced by incubating the protein tyrosine 
phosphatase in the presence of oxidizing agents such as a reactive oxygen species in 
a cellular environment 24 ' 25 , organic peracids e.g. MCPBA, peroxides e.g. hydrogen 
10 peroxide 24 ' 25 or compound(s) as described in the examples below. 

The assay can be a binding assay. Such a binding assay can be competitive or non- 
competitive and can accommodate the screening of a large number of compounds to 
determine if the compounds are capable of binding to the oxidized protein tyrosine 
phosphatase. Subsequently other assays can be carried out with compounds found 
15 to bind to determine the mode of binding of these compounds. 

Alternatively, the assay can be a functional assay that identifies compounds that 
trap the oxidized form of the protein tyrosine phosphatase and so change its ability 
to regain its functional activity on reduction. Such an assay can involve incubating 
potential trapping compounds with the oxidized form of the protein tyrosine 
20 phosphatase and determining if protein tyrosine phosphatase activity can be 
regained upon reduction. 

In a further alternative, the assay can be cell-based assay for identifying compounds 
which modulate the cell-based activity of a protein tyrosine phosphatase, through 
binding to the oxidized form of protein tyrosine phosphatases in cells. 

25 Particularly preferred types of assays include binding assays, functional assays and 
cell-based assays, which may be performed as follows: 
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Binding assays 

Purified, oxidized protein tyrosine phosphatases can be used for binding studies. 
Oxidized protein tyrosine phosphatase can be used in conventional filter-binding 
assays or in a high throughput scintillation proximity-type binding assay to detect 
5 binding of a radio-labelled ligand and its displacement by compounds which 
compete for the binding site. Radioactivity can be measured with a Packard 
Topcount or similar instrumentation capable of making rapid measurements for 96-, 
384- or 1536-well microtitre plate formats. 

Binding to oxidized protein tyrosine phosphatases could also be measured using a 
10 fluorescently labelled ligand, which could be displaced by compounds, competing 
for the binding site. Binding could be detected by fluorescent polarisation methods 
using an instrument such as the Packard Fusion reader to monitor fluorescence in 
96-, 384 or 1536-well microtitre plate formats. 

Another method for studying the binding of compounds to the oxidized protein 
15 tyrosine phosphatase makes use of a surface plasmon resonance effect, measured by 
a Biacore instrument. Oxidized protein tyrosine phosphatase could be attached to 
the biosensor chip of a Biacore and binding of test compounds could be monitored. 
Examples of the use of the surface plasmon resonance effect may be found in 
Parsons et al (1995) Nucleic Acids Res. 23, 21 1-216 and Parsons et al (1997) Anal 
20 Biochem.254(l), 82-87. 

Binding of test compounds to an oxidized protein tyrosine phosphatase could also 
be monitored using NMR techniques. The difference between NMR spectra of a 
test compound with and without the oxidized protein tyrosine phosphatase could be 
analysed to determine if the compound bound the protein tyrosine phosphatase. 
25 Competition between test compounds and a known ligand for a binding site on the 
protein tyrosine phosphatase could also be monitored in this way. 

Preferred compounds are those that have a Kd value of less than ImM, more 
preferably less than luM and most preferably less than lOOnM. The term "Kd" is 
used herein in its normal sense to mean the dissociation constant, which describes 
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the ratio of the concentrations at equilibrium between the free individual 
components and the complex formed. For a complex between two components A 
andB, Kd = [A][B]/[AB]. 

Functional assays 

The oxidized form of a protein tyrosine phosphatase is inactive, but can be 
reactivated by reduction. An example of a functional assay to monitor compounds 
that trap the oxidized form of the protein tyrosine phosphatase could involve 
measuring the time taken for an oxidized form of a protein tyrosine phosphatase to 
regain activity in the presence of the candidate inhibitor under reducing conditions. 

Candidate trapping compounds could then be screened against the oxidized form of 
the enzyme, by incubating candidate inhibitors and enzyme for a period of time and 
then adding a reducing agent such as DTT or glutathione. The activity of the 
enzyme could then be monitored at time intervals after this addition of reducing 
agent. Enzyme which has been trapped in the oxidized form by candidate inhibitors 
should take longer to regain activity than enzyme that has not been trapped. Time 
taken to regain activity could be measured against controls containing no test 
compound and so correlated to potency of the inhibitor. 

Assays for monitoring the activity of protein tyrosine phosphatases have been 
described in the literature and can use known substrates such as p-nitrophenyl 
phosphate or phosphorylated peptides (Hoppe et al., 1994, Eur. J. Biochem., 223, 
1069-1077; Bleasdale et al., 2001, Biochemistry, 40, 5642-5654; Wang et al., 1999, 
Biochim Biophys Acta, 1431, 14-23). Assays can be performed in 96- or 384- well 
microtitre format using a Molecular Devices Spectramax plate reader, allowing 
screening of a large number of compounds. The dephosphorylation of p- 
nitrophenyl phosphate by a FTP can be monitored be an increase in absorbance at 
405nm. The dephosphorylation of a phosphorylated peptide can be monitored by 
measuring phosphate release by the malachite green method. 
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Cell-based Assays 

Compounds can also be screened in cell-based assays, specific to the protein 
tyrosine phosphatase of interest. 

Compounds that affect PTP1B can be screened using assays that monitor the effects 
5 of insulin on cells. Examples of such assays are as follows: 

Cells such as 3T3-L1 can be differentiated into adipocytes and induced to be insulin 
resistant. Effect of compounds on glucose transport into these cells can be 
monitored by measuring the rate of uptake of 2-[ 3 H]deoxyglucose when the cells 
are stimulated by insulin. 

10 Another cell-based assay that can be used to monitor effects of compounds on 

ptplb is an assessment of insulin receptor tyrosine kinase activity. In this case the 
tyrosine kinase activity of the insulin receptor captured from cells, e.g. 3T3-L1, 
treated with test compounds is measured. Tyrosine kinase activity can be measured 
using a peptide substrate and [y 33 P]-ATP. 

15 A further assay involves monitoring the tyrosine phosphorylation of insulin 
signalling molecules in cells that have been treated with insulin and test 
compounds. Phosphorylation of molecules can be detected for example by Western 
blotting cell extracts using monoclonal antibodies (Bleasdale et al., 2001, 
Biochemistry, 40, 5642-5654). 

20 Preferred compounds of the invention are those that have IC50 values in a cellular 
assay of less than luM, more preferably less than lOOnM and most preferably less 
than lOnM. 

Pharmaceutical Formulations 

The compounds of the invention can be presented in the form of pharmaceutical 
25 compositions. 

In another aspect, therefore, the invention provides a pharmaceutical composition 
comprising a compound that binds to protein tyrosine phosphatase sulfenyl amide 
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to prevent or inhibit conversion of the protein tyrosine phosphatase sulfenyl amide 
to an active reduced form of the protein tyrosine phosphatase. 

The pharmaceutical compositions can be in any form suitable for oral, parenteral, 
topical, intranasal, intra-articular, ophthalmic, otic, rectal, intra-vaginal, or 
transdermal administration. Where the compositions are intended for parenteral 
administration, they can be formulated for intravenous, intramuscular or 
subcutaneous administration. 

Pharmaceutical dosage forms suitable for oral administration include tablets, 
capsules, caplets, pills, lozenges, syrups, solutions, powders, granules, elixirs and 
suspensions, sublingual tablets, wafers or patches and buccal patches. 

Pharmaceutical compositions containing compounds of the invention can be 
formulated in accordance with known techniques, see for example, Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, PA, USA. 

Thus, tablet compositions can contain a unit dosage of active compound together 
with an inert diluent or carrier such as a sugar or sugar alcohol, eg; lactose, sucrose, 
sorbitol or mannitol; and/or a non-sugar derived diluent such as sodium carbonate, 
calcium phosphate, calcium carbonate, or a celluloses or derivative thereof such as 
methyl cellulose, ethyl cellulose, hydroxypropyl methyl cellulose, and starches such 
as corn starch. Tablets may also contain such standard ingredients as binding and 
granulating agents agents such as polyvinylpyrrolidone, disintegrants (e.g. 
swellable crosslinked polymers such as crosslinked carboxymethylcellulose), 
lubricating agents (e.g. stearates), preservatives (e.g. parabens), antioxidants (e.g. 
BHT), buffering agents (for example phosphate or citrate buffers), and effervescent 
agents such as citrate/bicarbonate mixtures. Such excipients arc well known and do 
not need to be discussed in detail here. 

Capsule fonnulations may be of the hard gelatin or soft gelatin variety and can 
contain the active component in solid, semi-solid, or liquid form. Gelatin capsules 
can be formed from animal gelatin or synthetic or plant derived equivalents thereof. 
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The solid dosage forms (eg; tablets, capsules etc.) can be coated or un-coated, but 
typically have a coating, for example a protective film coating (e.g. a wax or 
varnish) or a release controlling coating. The coating (e.g. a Eudragit ™ type 
polymer) can be designed to release the active component at a desired location 
within the gastro-intestinal tract. Thus, the coating can be selected so as to degrade 
under certain pH conditions within the gastrointestinal tract, thereby selectively 
release the compound in the stomach or in the ileum or duodenum. 

Instead of, or in addition to, a coating, the drug can be presented in a solid matrix 
comprising a release controlling agent, for example a release delaying agent which 
may be adapted to selectively release the compound under conditions of varying 
acidity or alkalinity in the gastrointestinal tract. Alternatively, the matrix material 
or release retarding coating can take the form of an erodible polymer (e.g. a maleic 
anhydride polymer) which is substantially continuously eroded as the dosage form 
passes through the gastrointestinal tract. 

Compositions for topical use include ointments, creams, sprays, patches, gels, 
liquid drops and inserts (for example intraocular inserts). Such compositions can be 
formulated in accordance with known methods. 

Compositions for parenteral administration are typically presented as sterile 
aqueous or oily solutions or fine suspensions, or may be provided in finely divided 
sterile powder form for making up extemporaneously with sterile water for 
injection. 

Examples of formulations for rectal or intra-vaginal administration include 
pessaries and suppositories which may be, for example, formed from a shaped 
moldable or waxy material containing the active compound. 

Compositions for administration by inhalation may take the form of inhalable 
powder compositions or liquid or powder sprays, and can be administrated in 
standard form using powder inhaler devices or aerosol dispensing devices. Such 
devices are well known. For administration by inhalation, the powdered 
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formulations typically comprise the active compound together with an inert solid 
powdered diluent such as lactose. 

The compounds of the invention will generally be presented in unit dosage form 
and, as such, will typically contain sufficient compound to provide a desired level 
5 of biological activity. For example, a formulation intended for oral administration 
may contain from 0.1 milligrams to 2 grams of active ingredient, more usually from 
10 milligrams to 1 gram, for example, 50 milligrams to 500 milligrams. 
The active compound will be administered to a patient in need thereof (for example 
a human or animal patient) in an amount sufficient to achieve the desired 
10 therapeutic effect. 

In another aspect, the invention provides a method of preparing a composition 
comprising (a) identifying a the PTP sulfenyl amide modulator molecule (which 
may thus be termed a lead compound) by a method of any one of the other aspects 
of the invention disclosed herein and admixing the molecule with a carrier. 

1 5 Also provided is a method of preparing a composition comprising (a) identifying a 
the PTP sulfenyl amide modulator molecule (which may thus be termed a lead 
compound) by a method of any one of the other aspects of the invention disclosed 
herein and admixing an optimised structure of the modulator molecule with a 
carrier. 

20 The invention further provides a process for producing a medicament, 

pharmaceutical composition or drug, the process comprising: (a) identifying a the 
PTP sulfenyl amide modulator molecule (which may thus be termed a lead 
compound) by a method of any one of the other aspects of the invention disclosed 
herein; and (b) preparing a medicament, pharmaceutical composition or drug 

25 containing the modulator molecule. 

A further aspect of the present invention provides a method for preparing a 
medicament, pharmaceutical composition or drug, the method comprising: 
(a) identifying a the PTP sulfenyl amide modulator molecule (which may thus be 
termed a lead compound) by a method of any one of the other aspects of the 
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invention disclosed herein; (b) optimising the structure of the modulator molecule; 
and (c) preparing a medicament, pharmaceutical composition or drug containing the 
optimised modulator molecule. 

The above-described processes of the invention may be iterative in that the 
modified compound may itself be the basis for further compound design. Detailed 
structural information can be obtained about the binding of the candidate modulator 
to PTP sulfenyl amide , and in the light of this information adjustments can be made 
to the structure or functionality of the candidate modulator, e.g. to improve binding 
to the binding cavity or cavities. The above steps may be repeated and re-repeated 
as necessary. 

By "optimising the structure" we mean e.g. adding molecular scaffolding, adding or 
varying functional groups, or connecting the molecule with other molecules (e.g. 
using a fragment linking approach) such that the chemical structure of the 
modulator molecule is changed while its original modulating functionality is 
maintained or enhanced. Such optimisation is regularly undertaken during drug 
development programmes to e.g. enhance potency, promote pharmacological 
acceptability, increase chemical stability etc. of lead compounds. 

Modifications typically will be those conventional in the art known to the skilled 
medicinal chemist, and will include, for example, substitutions or removal of 
groups containing residues which interact with the amino acid side chain groups of 
a the PTP sulfenyl amide structure of the invention. For example, the replacements 
may include the addition or removal of groups in order to decrease or increase the 
charge of a group in a test compound, the replacement of a charge group with a 
group of the opposite charge, or the replacement of a hydrophobic group with a 
hydrophilic group or vice versa. It will be understood that these are only examples 
of the type of substitutions considered by medicinal chemists in the development of 
new pharmaceutical compounds and other modifications may be made, depending 
upon the nature of the starting compound and its activity. 

Methods of Treatment 
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It is envisaged that the compounds of the invention will be useful in the prophylaxis 
or treatment of a range of disease states or conditions mediated by protein tyrosine 
phosphatases. Examples of such disease states and conditions are set out above and 
include the treatment of cancers, diabetes (diabetes type I and II) obesity, 
5 autoimmune diseases, acute and chronic inflammation, rheumatoid arthritis, 
osteoporosis, proliferative disorders including various forms of cancer, growth 
disorders and hypertension 

Compounds of the invention are generally administered to a subject in need of such 
administration, for example a human or animal patient, preferably a human. 

10 The compounds will typically be administered in amounts that are therapeutically 
or prophylactically useful and which generally are non-toxic. However, in certain 
situations (for example in the case of life threatening diseases), the benefits of 
administering a compound of the invention may outweigh the disadvantages of any 
toxic effects or side effects, in which case it may be considered desirable to 

15 administer compounds in amounts that are associated with a degree of toxicity. 

A typical daily dose of the compound can be in the range from 100 picograms to 
100 milligrams per kilogram of body weight, more typically 10 nanograms to 10 
milligrams per kilogram of bodyweight although higher or lower doses may be 
administered where required. Ultimately, the quantity of compound administered 
20 will be commensurate with the nature of the disease or physiological condition 
being treated and will be at the discretion of the physician. 

The compounds of the invention can be administered as the sole therapeutic agent 
or they can be administered in combination therapy with one of more other 
compounds for treatment of a particular disease state, for example a neoplastic 
25 disease such as a cancer as hereinbefore defined. 

EXAMPLES 

The invention will now be illustrated in greater detail by reference to the specific 
embodiments described in the following non-limiting examples. 



EXAMPLE 1 
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PTP1B Expression, Purification. Crystallisation and Structure Determination 

Expression, purification and crystallisation of the catalytic domain of PTP1B 
(residues 1-321) were based on literature conditions - see Barford et al. 20 , the 
entire disclosure in which is incorporated herein by reference. 

5 PTP1B Expression. Purification and Crystallization Protocol 

Using the DNA sequence of human PTP1B (Genbaxik nm_002827), a fragment 
encoding the N-terminal 321 residues was generated and cloned into the expression 
vector Pet 19b (Novagen) at the Ncol site enabling the initiation of translation at 
Metl. Primers used to generate the plasmid were: 

10 S-TTTTCCATGGAGATGGAAAAGGAGTTCG-S' (SEQ.ID. NO: 1) 

S^TTTTCCATGGCTAATTGTGTGGCTCCAGGATTCG-S 1 . (SEQ.ID. NO: 2) 

E.coli bl21 (de3) cells transformed with Petl9b-PTP1B were grown overnight at 
37°C in LB medium plus 100|j,g ampicillin/ml. Typically, lOmls of this overnight 
culture was used to inoculate 1 litre of LB plus 100p,g ampicillin/ml. Cultures were 
15 grown at 37°C for 3 hours prior to induction by addition of isopropyl-thio-p-d~ 
galactopyranoside (IPTG) to a final concentration of ImM. The cultures were 
grown for a further 3 hours before being harvested by centrifugation. 

All purification steps were performed at 4°C and all buffers unless stated otherwise 
contained a 1/1000 dilution of protease inhibitor cocktail III (Calbiochem). 

20 Bacterial pellets were resuspended on ice in 20mM imidazole, pH7.5, ImM EDTA, 
3mM DTT, 10% (v/v) glycerol and lysed by sonication (2 mins, 20 second pulses). 
The lysed cells were incubated with DNAase 1 (Sigma) for 10 minutes at 4°C. 
Following this, the lysate was clarified by centrifugation at 25,000 rpm for 30 
minutes. Protein was applied to a Q-sepharose fast flow column incorporated into 

25 an Akta fplc system (Amersham Biosciences) at a flow rate of 4ml/min. a linear salt 
gradient (0-0.5m NaCl in 20mM imidazole, pH7.5, ImM EDTA, 3mM DTT, 10% 
(v/v) glycerol) was applied to the column and PTP1B eluted at ~300mM NaCl. 
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Fractions containing PTP1B were pooled and buffer exchanged using a 26/10 
desalting column (Amersham Biosciences) pre-equilibrated in 25mM NaH2P0 4 pH 
6.5, ImM EDTA, 3mM DTT, 10% (v/v) glycerol. PTP1B fractions were applied to 
a 10/10 mono s column previously equilibrated in 25mM NaH 2 P0 4 pH 6.5, ImM 
5 EDTA, 3mM DTT, 1 0% (v/v) glycerol Protein fractions were eluted by applying a 
linear salt gradient (0-0.5m NaCl) to the column with PTP1B eluting at ~75mM 
NaCl. PTP1B was buffer exchanged into lOmM Tris pH7.5, 25mM NaCl, 0.2mM 
EDTA and 3mM DTT. Protein purity was assessed by SDS PAGE and was 
observed to be >95% pure. This protein was subsequently concentrated to lOmg/ml 
10 and used for crystallisation. 

Using the hanging drop method ( Crystallization of nucleic acids and proteins. A 
practical approach. A. Ducruix and R. Giege. Oxford University press 1999. and 
Principles of Protein X-ray Crystallography. Jan Drenth. Springer Verlag 1994.), 
PTP1B crystals were grown at 4°C from 4yl drops (1:1 ratio proteinrreservoir 
15 solution) and equilibrated against 1ml reservoir solution consisting of 12-18% (v/v) 
PEG 4000, 0.1 m HEPES (pH 7.5) and 0.2 M magnesium acetate. 

Method for assaying PTP1B activity: 

PTP1B enzyme activity was assayed by measuring the dephosphorylation of p- 
nitrophenyl phosphate to p-nitrophenol. The reaction was monitored by following 

20 the increase in absorbance at 405 nm as p-nitrophenol was produced. Standard 
assays typically contained 0.25 mM p-nitrophenyl phosphate substrate and 25 nM 
PTP1B enzyme in 50 mM HEPES, pH 6.5, 1 mM DTT, 1 mM EDTA, 0.01% 
CHAPS buffer. Assays were carried out in Costar 3696 half-area plates in an assay 
volume of lOOul on a Spectramax plate reader (Molecular Devices). Reactions 

25 were typically monitored at 20 second intervals for 30 minutes. For kinetic 

measurements of km the concentration of the p-nitrophenyl phosphate substrate was 
varied between 3 1 uM and 5 mM. 

Activity of PTP1B was also monitored by measuring the phosphate produced in the 
reaction. Standard assays were set up as described above, except that 50ul of 
30 malachite green reagent was added to quench the reaction and the absorbance at 



WO 2004/087905 



80 



PCT/GB2004/001446 



620 iun was then measured on a Spectramax plate reader (Molecular Devices) to 
determine the amount of phosphate produced. Malachite green reagent is made 
from 0.2% w/v malachite green and 4.2% w/v ammonium molybdate mixed in a 
ratio of 3:1. 

5 For overnight soaking experiments crystals were transferred into a standard mother 
liquor containing 16% PEG 4000, 0.1 M HEPES (pH 7.5), 0.2 M magnesium 
acetate, 10 mM DTT and the compound of interest (See Table 3 below). 

All data were collected at 100 K. Data sets of sulfenic- sulfinic- and sulfonic acid 
PTP1B derivatives were automatically collected on a Jupiterl40 CCD detector 
10 mounted on a RU3HR rotating anode generator equipped with an ACTOR sample- 
changing robot (RigakuMSC, The Woodlands Tx, USA). They were processed and 
scaled using D*TREK 21 and converted to structure factors using programs from the 
CCP4 suite 22 . 

Oxidation of Cys215 to the sulfenyl-amide derivative was carried out by soaking 
15 crystals for approximately 24 hours in mother liquor without DTT, but containing 
100 mM 2-phenyl-isoxazolidine-3,5-dione. Data were collected at station 14.1 
(SRS, Daresbury) and processed and scaled using MOSFLM/SCALA 22 . 

Reversibility of the sulfenyl-amide derivative was checked by first soaking two 
crystals in 100 mM 2-phenyl-isoxazolidine-3,5-dione for 24 hours. Sulfenyl-amide 
20 formation was confirmed by an in-house data set of one of the crystals. After back 
soaking for 24 hours in mother liquor containing 20 mM reduced glutathione, data 
collected from the other crystal confirmed the active site in its native conformation. 

All results were evaluated using the graphics programs QUANTA (Accelrys, San 
Diego CA, USA) and Astexviewer™* 23 . Data collection statistics are summarised 
25 in Table 2. Initial refinement was always carried out using the CCP4-based Astex 
automatic refinement scripts, followed by rounds of positional and B-factor 
refinement with REFMAC5 22 alternated with manual rebuilding steps using 
QUANTA. 
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Monomer libraries for the different oxidation states were generated using 
REFMAC5 in combination with the monomer sketcher of the CCP4GUI 22 . 



Table 2: Crystallographic data collection and refinement statistics 




Sulfenic 
acid 


Sulfmic 
acid 


Sulfonic 
acid 


Sulfenyl- 
amide 


Back soaking experiment 


Compound 


Glutathione 


Data collection 
Beamline 


In-house 


Ih-house 


In-house 


SRS 14.1 


In-house 


In-house 


MA) 


1.54 


1.54 


1.54 


1.488 


1.54 


1.54 


Resolution (A) 


2.3 


2.6 


2.2 


2.2 


2.4 


2.2 


No. Observations 


43942 


29624 


53452 


83434 


103350 


108793 


No. Unique 
reflections 


20755 


16237 


23367 


23591 


18514 


23322 


Completeness 
(%) * 


96.0 
(94.9) 


95.9 
(95.3) 


96.0 
(86.9) 


98.9 
(97.4) 


99.8 (100) 


100 
(100) 


Emerge 1 " 


0.092 
(0.25) 


0.172 
(0.37) 


0.060 
(0.252) 


0.050 
(0.26) 


U.U /o 

(0.273) 


(0.32) 


I/a<I>* 


7.3 (2.8) 


4.6 (1.9) 


11.2(4.3) 


7.8 (2.8) 


8.4 (2.8) 


12.9 
(2.2) 


Refinement 
Rcryst/Rfree 


0.187/ 
0.248 


0.204/ 
0.263 


0.202/ 
0.242 


0.243/ 
0.283 


0.215/ 
0.275 


0.228/ 
0.277 


Rmsd Bond 
lengths (A) 


0.013 


0.018 


0.012 


0.009 


0.006 


0.006 


Rmsd Bond 
angles (°) 


1.4 


1.7 


1.3 


1.4 


1.3 


1.3 
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♦Numbers in parentheses indicate the highest shell values 

.'Rmerge = S h 2i|I(h,i)-<I>(h)|2hSi<I>(h); I(h,i) is the scaled intensity of the ith 
observation of reflection h and <I>(h) is the mean value. Summation is over all 
measurements. 

5 Rcryst =£hki 1 work||Fobs|-k|F C aic||/£hkljF ob sI, where Fobs and Fcalc are the observed and 
calculated structure factors, k is a weighting factor and work denotes the working 
set of 95% of the reflections used in the refinement. 

Rfree=Shki,test||Fobs|-k|F C a]ci|/£hki|Fobs|, where Fobs and Fcalc are the observed and 
calculated structure factors, k is a weighting factor and test denotes the test set of 
1 0 5% of the reflections used in cross validation of the refinement. 
X refers to wavelength, Rmsd to root mean square deviations. 



Composition of cocktails used in soaking experiments: 

Cocktail composition used in the soaking experiment resulting in the sulfenic acid 
(Cys-SOH) PTP1B derivative: 



TABLE 3 



Compou nds Used in Soaking Experiments 



Structure of 2-Phenyl-isoxazoUdine-3,5-dione: 





F 






H 
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Cocktail composition used in the soaking experiment resulting in the sulfinic acid 
(Cys-S02H) PTP1B derivative: 




Cocktail composition used in the soaking experiment resulting in the sulfonic acid 
(Cys-S03H) PTP1B derivative: 




PHARMACEUTICAL FORMULATIONS 

EXAMPLE 2 
5 (i) Tablet Formulation 

A tablet composition containing a compound of the invention is prepared by mixing 
50mg of the compound with 197mg of lactose (BP) as diluent, and 3mg 
magnesium stearate as a lubricant and compressing to form a tablet in known 
manner. 

0 

fii) Capsule Formulation 

A capsule formulation is prepared by mixing lOOmg of a compound of the 
invention with lOOmg lactose and filling the resulting mixture into standard opaque 
hard gelatin capsules. 



15 
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Equivalents 

It will readily be apparent that numerous modifications and alterations may be made 
to the specific embodiments of the invention described above without departing 
from the principles underlying the invention. All such modifications and alterations 
5 are intended to be embraced by this application. 
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TABLE 1 

Unit cell dimensions: 

a - 87.686 A, b « 87.686 A, c = 103.721 A, a = 90.00°, p «= 90. 00°, 
Y - 120.00° 



Space group: 
P3 X 2 1 
Table 1 
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159 65.664 24.081 

159 64.144 25.601 

160 62.195 21.197 
160 62.685 19.953 
160 63.812 19.485 
160 63.610 19.381 
160 61.511 18.967 
160 61.364 17.751 
160 59.970 17.156 

160 61.552 18.064 

161 64.997 19.221 
161 66.130 18.742 
161 66.536 17.335 
161 66.820 17.090 
161 67.339 19.683 
161 68.406 19.487 
161 69.724 20.166 
161 69.708 21.310 

161 70.785 19.552 

162 66.548 16.405 
162 67.179 15.107 
162 68.695 15.304 
162 69.323 15.397 
162 66.775 14.119 
162 67.313 12.711 
162 68.520 12.511 

162 66.413 11.728 

163 69.267 15.380 
163 70.678 15.724 
163 71.705 14.880 
163 72.784 15.382 
163 71.040 15.688 
163 70.544 16.860 
163 71.139 16.782 

163 70.853 18.219 

164 71.381 13.619 
164 72.294 12.743 
164 72.340 13.146 
164 73.420 13.347 
164 71.869 11.255 
164 72.214 10.768 

164 72.671 10.347 

165 71.155 13.280 
165 71.001 13.529 
165 70.880 15.016 
165 70.856 15.342 
165 69.759 12.769 
165 68.565 13.344 

165 69.733 11.322 

166 70.801 15.900 
166 70.591 17.348 
166 69.306 17.803 
166 69.146 19.003 
166 71.836 18.000 
166 73.145 17.762 
166 74.080 18.970 
166 74.818 19.185 
166 74.052 19.751 
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68.390 16.881 42.226 1.00 57.39 

67.062 17.257 41.692 1.00 57.78 

66.280 18.111 42.705 1.00 53.55 

66.476 17.992 43.914 1.00 51.23 

66.178 16.035 41.387 1.00 59.44 

66.720 14.998 40.420 1.00 64.57 

66.137 13.621 40.700 1.00 68.48 

64.902 13.441 40.554 1.00 71.00 

66.909 12.723 41.093 1.00 70.28 

65.383 18.948 42.200 1.00 50.37 

64.550 19.809 43.034 1.00 47.15 

63.097 19.658 42.641 1.00 45.69 

62.799 19.382 41.487 1.00 46.95 

64.922 21.271 42.843 1.00 46.52 

64.870 21.595 41.446 1.00 45.51 

66.361 21.534 43.269 1.00 46.99 

62.199 19.873 43.594 1.00 41.85 

60.769 19.945 43.309 1.00 39.92 

60.167 21.092 44.097 1.00 36.00 

60.542 21,301 45.234 1.00 36.05 

60.091 18.631 43.678 1.00 41.63 

60.529 17.453 42.803 1.00 47.97 

60.037 16.084 43.258 1.00 53.84 

59.410 15.355 42.159 1.00 61.38 

58.147 15.510 41.770 1.00 65.71 

57.685 14.806 40.743 1.00 69.78 

57.334 16.350 42.402 1.00 66.33 

59.266 21.842 43.473 1.00 31.05 

58.476 22.866 44.130 1.00 34.08 

57.255 22.215 44.822 1.00 32.33 

56.479 21.527 44.184 1.00 31.31 

58.004 23.911 43.091 1.00 38.29 

57.032 24.969 43.634 1.00 47.97 

56.421 25.896 42.573 1.00 52.73 

56.787 25.827 41.372 1.00 60.61 

55.562 26.720 42.954 1;00 55.45 

57.093 22.444 46.122 1.00 31.01 

55.911 22.019 46.866 1.00 25.68 

55.143 23.212 47.391 1.00 24.75 

55.738 24.165 47.917 1.00 27.08 

56.324 21.139 48.055 1.00 27.41 

57.228 19.983 47.605 1.00 28.28 

55.093 20.620 48.752 1.00 31.23 

56.592 19.051 46.602 1.00 30.57 

53.821 23.172 47.286 1.00 22.95 

52.987 24.192 47.915 1.00 24.79 

52.448 23.652 49.233 1.00 27.69 

51.962 22.526 49.287 1.00 27.52 

51.840 24.622 46.980 1.00 25.78 

52.298 25.213 45.637 1.00 28.93 

51.119 25.492 44.738 1.00 33.95 

53.075 26.505 45.846 1.00 33.75 

52.550 24.467 50.279 1.00 25.46 

52.099 24.163 51.619 1.00 25.40 

50.884 25.038 51.895 1.00 27.85 

50.992 26.255 51.960 1.00 25.57 

53.219 24.451 52.635 1.00 27.52 

52.962 23.882 54.003 1.00 28.74 
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63.804 37.045 61.091 1.00 55.70 

64.563 27.770 54.244 1.00 27.13 

60.616 34.916 57.253 1.00 39.66 

60.004 37.130 53.947 1.00 48.69 

68.398 28.837 52.842 1.00 34.34 

63.996 35.732 55.676 1.00 42.07 

63.949 27.947 59.239 1.00 50.17 

69.301 26.294 58.726 1.00 45.15 

65.986 19.006 60.572 1.00 55.43 

67.837 21.065 59.796 1.00 48.96 

62.227 10.934 59.636 1.00 26.45 

64.832 13.539 60.267 1.00 40.96 
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46.521 12.392 42.140 1.00 47.31 

52.916 22.239 37.026 1.00 55.49 

45.869 13.974 60.909 1.00 34.41 

48.083 16.068 66.708 1.00 33.89 

43.023 14.129 61.953 1.00 53.88 
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TABLE 2 

All atoms as in Table 1 (incorporated herein by reference) except for the following 
six changes: 

5 1) Delete lines for Atom 1770-1793 
2) Replace with: 
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3) Renumber following atoms (from ALA 217) accordingly. 

4) Insert following line after new ATOM 2297 (old ATOM 2309): 

TER 2298 GLY A 283 

5) Insert following Header to PDB file: 
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35 6) Insert following Footer instead of END line: 

CONECT 1775 1776 

CONECT 1776 1775 

CONECT 2299 2405 2309 2339 

40 CONECT 2309 2299 

CONECT 2339 2299 

CONECT 24 05 22 99 

MASTER 498 0 1 9 H o 
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45 END 
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CLAIMS 

1 . Isolated sulfenyl amide cysteine-containing protein, or a homologue, allelic 
form, species variant, derivative or mutein thereof. 

2. Isolated protein sulfenyl amide characterised by the HC(X5)R signature 
motif, or a homologue, allelic form, species variant, derivative or mutein 
thereof. 

3. Isolated PTP sulfenyl amide, or a homologue, allelic form, species variant, 
derivative or mutein thereof. 

4. A process for screening for an inhibitor of a protein (such as PTP) capable 
of forming a sulfenyl amide as defined in any one of claims 1 to 3, which 
process comprises the steps of: (a) providing a sulfenyl amide of the protein 
(e.g. PTP sulfenyl amide) (or a homologue, allelic form, species variant, 
derivative or mutein thereof); (b) contacting the sulfenyl amide of step (a) 
with a test compound; and (c) determining whether the test compound binds 
to the sulfenyl amide. 

5. A process for producing an inhibitor of a protein (such as PTP) capable of 
forming a sulfenyl amide as defined in any one of claims 1 to 3, which 
process comprises the steps of: (a) providing a sulfenyl amide of the protein 
(e.g. PTP sulfenyl amide) (or a homologue, allelic form, species variant, 
derivative or mutein thereof); (b) contacting the sulfenyl amide of step (a) 
with a test compound; (c) determining whether the test compound binds to 
the sulfenyl amide; and (d) identifying the test compound as an inhibitor 
(e.g. a PTP inhibitor) on the basis of its ability to prevent or inhibit the 
reductive activation of the sulfenyl amide (e.g. PTP sulfenyl amide) to 
active protein (e.g. PTP). 

6. The process of claim 5 wherein at least two chemically distinct test 
compounds are identified in step (d) and wherein the process further 
comprises the step of linking two or more of the chemically distinct 
compounds to produce a multimeric inhibitor. 
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The process of claim 5 or claim 6 for producing a pharmaceutical 
composition further comprising the step of: (e) incorporating the inhibitor 
identified in step (d) into a pharmaceutical excipient. 

The sulfenyl amide of any one of claims 1 to 3 which is suitable for use in 
the process of any one of claims 4 to 7. 

A protein (e.g. PTP) inhibitor obtainable by, or obtained by, the process of 
any one of claims 4 to 6. 

A pharmaceutical composition obtainable by, or obtained by, the process of 
claim 7. 

Use of a protein sulfenyl amide (e.g. PTP sulfenyl amide) for drug 
screening. 

The use of a compound for the manufacture of a medicament for the 
treatment of a disease or condition mediated by protein tyrosine 
phosphatase, wherein the compound is one that binds to protein tyrosine 
phosphatase sulfenyl amide to prevent or inhibit conversion of the protein 
tyrosine phosphatase sulfenyl amide to an active reduced form of the protein 
tyrosine phosphatase. 

A method of reducing the activity of a protein tyrosine phosphatase (PTP), 
the PTP being one which is convertible between an active form and an 
inactive form, the inactive form being characterised by the presence of a 
sulfenyl amide moiety formed at the active site of the PTP between the 
sulphur atom of a cysteine group and a backbone nitrogen atom of a 
neighbouring amino acid, whereby the sulfenyl amide moiety distorts and 
inactivates the active site of the PTP and wherein the sulfenyl amide moiety 
is disruptible to restore the inactivate form of the PTP to the active form 
thereof; 

which method comprises inhibiting disruption of the sulfenyl amide moiety, 
or modifying the sulfenyl amide moiety to prevent restoration of the inactive 
form of the PTP to the active form. 
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A method according to claim 13 wherein the sulfenyl amide moiety is 
disruptible by reaction with a reducing agent to restore the inactivate form 
of the PTP to the active form thereof. 

A method according to claim 13 or claim 14 wherein the sulfenyl amide 
moiety is disruptible to regenerate 1he cysteine group. 

A method according to claim any one of claims 1 3 to 15 which comprises 
inhibiting disruption of the sulfenyl amide moiety by means of a ligand that 
binds to the inactivated active site of the PTP. 

A method according to any one of claims 13 to 15 which comprises 
modifying the sulfenyl amide moiety to prevent restoration of the inactive 
form of the PTP to the active form. 

A method according to claim 17 which comprises reversibly modifying the 
sulfenyl amide moiety. 

A method according to claim 17 which comprises irreversibly modifying the 
sulfenyl amide moiety. 

A method according to any one of claims 17 to 19 in which the sulfenyl 
amide moiety is modified by reaction with a nucleophilic ligand. 

A method according to claim 20 wherein the sulfenyl amide moiety is 
modified by reaction with a nucleophilic ligand having a nucleophilic group 
that will react with the sulfenyl amide moiety, and a binding region for 
binding to the PTP sulfenyl amide in the region of the sulfenyl amide 
moiety. 

A method according to claim 21 wherein the nucleophilic group is selected 
from the group consisting of a thiol, disulfane, primary thioamide, 
secondary thioamide, primary thiourea, secondary thiourea, primary amine, 
secondary amine, primary hydrazine, secondary hydrazine, primary 
hydrazide, secondary hydrazide, primary hydrazone, secondary hydrazone, 
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primary amide, secondary amide, primary urea, secondary urea, primary 
sulfonamide, secondary sulfonamide, 5-membered ring heterocycle 
containing NH, alcohol, hydroxylamine, oxime, hydxoxamic acid, 
carboxylic acid (preferably other than an oxalamic acid), sulfoxide, sulfate 
and a nitrone. 

A method of identifying by rational drug design a compound capable of 
reducing the level of activity of a protein tyrosine phosphate (PIT) in a 
cellular environment, the PTP being one which is convertible in a cellular 
environment between an active form and an inactive form, the inactive form 
being characterised by the presence of a sulfenyl amide moiety formed at 
the active site of the PTP between the sulphur atom of a cysteine group and 
a backbone nitrogen atom of a neighbouring amino acid; 
which method comprises: 

(a) designing a ligand that will (i) bind to the active site in the region of the 
sulfenyl amide moiety to inhibit conversion of the inactive form back to the 
active form, or (ii) modify the sulfenyl amide moiety to inhibit conversion 
of the inactive form of the PTP to the active form; 

(b) synthesizing the ligand; and 

(c) determining whether the ligand reduces the level of activity of a protein 
tyrosine phosphate (PTP) in a cellular environment. 

A method according to claim 23 wherein the PTP is PTP IB and the ligand is 
one which is capable of binding to the sulfenyl amide PTP IB at a binding 
site as defined in any one of claims 52 to 65 

A method according to any one of the preceding claims wherein the protein 
tyrosine phosphatase is characterised by a signature sequence of the 
formula: (IAOHCXAGXXR(S/T/G) at a catalytic site thereof wherein the 
amino acid C is cysteine 215, and wherein the sulfenyl amide moiety is 
formed between the sulphur atom of cysteine 215 and a backbone nitrogen 
atom of a neighbouring amino acid. 

A crystal of sulfenyl amide protein tyrosine phosphatase IB. 
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A crystal of sulfenyl amide protein tyrosine phosphatase IB having a Unit 
cell dimensions: a = 87.686 A, b = 87.686 A, c - 103.721 A, a = 90.00°, p 
= 90.00°, y = 120.00° and a space group: P3i 2 1. 

A crystal of sulfenyl amide protein tyrosine phosphatase IB having a 
resolution better than, i.e. numerically lower than, 3.0 A. 

A crystal of sulfenyl amide protein tyrosine phosphatase IB having the 
structure defined by the coordinates of Table 1 or Table 2 ± root mean 
square deviation from the Ca atoms of not more than 1.5 A. 

A method of homology modeling comprising the steps of: (a) aligning a 
representation of an amino acid sequence of a target sulfenyl amide protein 
tyrosine phosphatase protein of unknown three-dimensional structure with 
the amino acid sequence of the sulfenyl amide protein tyrosine phosphatase 
IB of Table 1 or Table 2 to match homologous regions of the amino acid 
sequences; (b) modeling the structure of the matched homologous regions of 
said target sulfenyl amide protein tyrosine phosphatase of unknown 
structure on the corresponding regions of the sulfenyl amide protein tyrosine 
phosphatase IB structure as defined by the coordinates of Table 1 or Table 2 
± root mean square deviation from the Ca atoms of not more than 1 .5A; and 
(c) determining a conformation (e.g. so that favorable interactions are 
formed within the target sulfenyl amide protein tyrosine phosphatase of 
unknown structure and/or so that a low energy conformation is formed) for 
said target sulfenyl amide protein tyrosine phosphatase of unknown 
structure which substantially preserves the structure of said matched 
homologous regions. 

A method according to claim 30 wherein one or all of steps (a) to (c) are 
performed by computer modeling. 

A method for determining the structure of a protein, which method 
comprises; providing the co-ordinates of Table 1 or Table 2 ± root mean 
square deviation from the Ca atoms of not more than 1.5A, and either (a) 
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positioning the co-ordinates in the crystal unit cell of said protein so as to 
provide a structure for said protein or (b) assigning NMR spectra peaks of 
said protein by manipulating the coordinates of Table 1 or Table 2. 

A method according to claim 32 wherein the co-ordinates of Table 1 or 
Table 2 ± root mean square deviation from the Ca atoms of not more than 
1.5 A are used to solve the structure of a target sulfenyl amide protein 
tyrosine phosphatase, particularly homologues of sulfenyl amide protein 
tyrosine phosphatase IB for example PTP-a, T-cell PTP, or LAR. 

A system, particularly a computer system, the system containing either (a) 
atomic coordinate data according to Table 1 or Table 2 ± root mean square 
deviation from the Ca atoms of not more than 1 .5 A, said data defining the 
three-dimensional structure of sulfenyl amide protein tyrosine phosphatase 
IB or at least selected coordinates thereof; (b) structure factor data (where a 
structure factor comprises the amplitude and phase of the diffracted wave) 
for sulfenyl amide protein tyrosine phosphatase IB, said structure factor 
data being derivable from the atomic coordinate data of Table 1 or Table 2 ± 
root mean square deviation from the Ca atoms of not more than 1 .5 A; (c) 
atomic coordinate data of a target sulfenyl amide protein tyrosine 
phosphatase protein generated by homology of the target based on the data 
of Table 1 or Table 2 ± root mean square deviation from the Ca atoms of 
not more than 1.5A; (d) atomic coordinate data of a target sulfenyl amide 
protein tyrosine phosphatase protein generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 or 
Table 2 ± root mean square deviation from the Ca atoms of not more than 
1 .5 A; or (e) structure factor data derivable from the atomic coordinate data 
of (c) or (d). 

A computer-readable storage medium, comprising a data storage material 
encoded with computer readable data, wherein the data are defined by all or 
a portion (e.g. selected coordinates as defined herein) of the structure 
coordinates of sulfenyl amide protein tyrosine phosphatase IB of Table 1 or 
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Table 2 ± root mean square deviation from the Ca atoms of not more than 
1.5 A, or a homologue of sulfenyl amide protein tyrosine phosphatase IB, 
wherein said homologue comprises backbone atoms that have a root mean 
square deviation from the backbone atoms (nitrogen-carbon a -carbon) of 
Table 1 or Table 2 of not more than 1.5 A. 

A computer-readable data storage medium comprising a data storage 
material encoded with a first set of computer-readable data comprising a 
Fourier Transform of at least a portion (e.g. selected coordinates as defined 
herein) of the structural coordinates for sulfenyl amide protein tyrosine 
phosphatase IB according to Table 1 or Table 2 =b root mean square 
deviation from the Ca atoms of not more than 1 .5A; which, when combined 
with a second set of machine readable data comprising an X-ray diffraction 
pattern of a molecule or molecular complex of unknown structure, using a 
machine programmed with the instructions for using said first set of data 
and said second set of data, can determine at least a portion of the structure 
coordinates corresponding to the second set of machine readable data. 

Computer readable media with at least one of: (a) atomic coordinate data 
according to Table 1 or Table 2 ± root mean square deviation from the Ca 
atoms of not more than 1 .5 A recorded thereon, said data defining the three- 
dimensional structure of sulfenyl amide protein tyrosine phosphatase IB, or 
at least selected coordinates thereof; (b) structure factor data for sulfenyl 
amide protein tyrosine phosphatase IB recorded thereon, the structure factor 
data being derivable from the atomic coordinate data of Table 1 or Table 2 ± 
root mean square deviation from the Ca atoms of not more than 1 .5A; (c) 
atomic coordinate data of a target sulfenyl amide protein tyrosine 
phosphatase protein generated by homology modeling of the target based on 
the data of Table 1 or Table 2 ± root mean square deviation from the Ca 
atoms of not more than 1.5A; (d) atomic coordinate data of a sulfenyl amide 
protein tyrosine phosphatase IB-ligand complex or a sulfenyl amide protein 
tyrosine phosphatase IB homologue or analogue generated by interpreting 
X-ray crystallographic data or NMR data by reference to the data of Table 1 
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or Table 2 ± root mean square deviation from the Ca atoms of not more than 
1.5 A; and (e) structure factor data derivable from the atomic coordinate data 
of(c)or (d). 

A method of providing data for generating structures and/or performing 
rational drug design for sulfenyl amide protein tyrosine phosphatase IB, 
sulfenyl amide protein tyrosine phosphatase IB homologues or analogues, 
complexes of sulfenyl amide protein tyrosine phosphatase IB with a 
candidate modulator, or complexes of sulfenyl amide protein tyrosine 
phosphatase IB homologues or analogues with candidate modulators, the 
method comprising: 

(i) establishing communication with a remote device containing 
computer-readable data comprising at least one of: (a) atomic coordinate 
data according to Table 1 or Table 2 ± root mean square deviation from the 
Ca atoms of not more than 1 ,5A, said data defining the three-dimensional 
structure of sulfenyl amide protein tyrosine phosphatase IB, at least one 
sub-domain of the three-dimensional structure of sulfenyl amide protein 
tyrosine phosphatase IB, or the coordinates of a portion of atoms of sulfenyl 
amide protein tyrosine phosphatase IB; (b) structure factor data for sulfenyl 
amide protein tyrosine phosphatase IB, said structure factor data being 
derivable from the atomic coordinate data of Table 1 or Table 2 ± root mean 
square deviation from the Ca atoms of not more than 1 .5A; (c) atomic 
coordinate data of a target sulfenyl amide protein tyrosine phosphatase IB 
homologue or analogue generated by homology modeling of the target 
based on the data of Table 1 or Table 2 ± root mean square deviation from 
the Ca atoms of not more than 1.5 A; (d) atomic coordinate data of a protein 
generated by interpreting X-ray crystallographic data or NMR data by 
reference to the data of Table 1 or Table 2 ± root mean square deviation 
from the Ca atoms of not more than 1.5 A; and (e) structure factor data 
derivable from the atomic coordinate data of (c) or (d); and 

(ii) receiving said computer-readable data from said remote device. 

A computer-based method of rational drug design which comprises: 
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providing the structure of the PTPlb sulfenyl amide as defined by 

the coordinates of Table 1 or Table 2 ± root mean square deviation from the 

Ca atoms of not more than 1 .5 A; 

providing the structure of a candidate modulator molecule; and 
fitting the structure of candidate to the structure of the sulfenyl 

amide of Table 1 or Table 2 ± root mean square deviation from the Ca 

atoms of not more than 1.5 A. 

A method of rational drug design which comprises; 

providing the structure of the PTP1B sulfenyl amide as defined by 

the coordinates of Table 1 or Table 2 ± root mean square deviation from the 

Ca atoms of not more than 1.5 A; 

providing the structure of a candidate compound; and 

fitting the structure of the candidate compound to the structure of the 

sulfenyl amide as defined by the coordinates of Table 1 or Table 2 ± root 

mean square deviation from the Ca atoms of not more than 1.5 A. 

A method of identifying by rational drug design a compound capable of 
reducing the level of activity of a protein tyrosine phosphatase (PTP) in a 
cellular environment, the PTP being one which is convertible in a cellular 
environment between an active form and an inactive or less active form, the 
inactive form or less active form being characterised by the presence of a 
sulfenyl amide moiety formed at the active site of the PTP between the 
sulphur atom of a cysteine group and a backbone nitrogen atom of a 
neighbouring amino acid; 

which method comprises: 

(a) designing a ligand that will (i) bind to the active site in the region of the 
sulfenyl amide moiety to inhibit conversion of the inactive form back to the 
active form, or (ii) modify the sulfenyl amide moiety to inhibit conversion 
of the inactive form of the PTP to the active form; 

(b) synthesizing the ligand; and 

(c) determining whether the ligand reduces the level of activity of a protein 
tyrosine phosphate (PTP) in a cellular environment. 
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A computer-based method of rational drug design which comprises: 

providing the coordinates of at least two atoms of Table 1 or Table 2 

of the PTP1B sulfenyl amide ("selected coordinates"); 

providing the structure of a candidate modulator molecule; and 
fitting the structure of candidate to the selected coordinates of the 

PTP IB sulfenyl amide. 

A method for determining the structure of a compound bound to sulfenyl 
amide PTP IB, said method comprising: (a) providing a crystal of sulfenyl 
amide PTP lb according to the invention; (b) soaking the crystal with said 
compounds; and (c) determining the structure of said sulfenyl amide PTP lb 
compound complex by employing the data of Table 1 or Table 2 ± root 
mean square deviation from the Ca atoms of not more than 1.5 A. 

A method of inhibiting or preventing the reduction of sulfenyl amide 
PTB1B to PTB1B in a cellular environment by exposing the PTB1B to a 
ligand capable of binding to the sulfenyl amide PTB1B in the region of the 
sulfenyl amide moiety so as to prevent reduction of the sulfenyl amide 
moiety by an intracellular reducing agent. 

A method of inhibiting or preventing the reduction of sulfenyl amide 
PTB1B to PTB1B in a cellular environment by exposing the PTB1B to a 
ligand capable of binding to the sulfenyl amide PTB1B in the region of the 
sulfenyl amide moiety, the ligand having a nucleophilic moiety capable of 
modifying the sulfenyl amide moiety so as to prevent its reduction by an 
intracellular reducing agent. 

A method according to claim 44 or claim 45 wherein the ligand is capable of 
binding to the sulfenyl amide PTP IB at a binding site as defined in any one 
of claims 52 to 65. 

A novel compound per se that inhibit protein tyrosine phosphatases by 
interacting with sulfenyl amide PTP to prevent or inhibit conversion of the 
PTP sulfenyl amide to an active form of the protein tyrosine phosphatase. 
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A compound according to claim 47 for use in medicine, for example for use 
in the treatment of diseases or conditions mediated by protein tyrosine 
phosphatase. 

A compound according to claim 47 or claim 48 which is a non-covalent 
binding inhibitor that stabilises the sulfenyl-amide protein form. 

A compound according to claim 47 or claim 48 which binds to and 
reversibly modifies the sulfenyl-amide form of the protein, e.g. by reacting 
with the sulfenyl amide moiety, and in so doing, preventing reactivation of 
the sulfenyl amide PTP by physiological cell cycling. 

A compound according to claim 47 or claim 48 which binds to and 
irreversibly modifies the sulfenyl-amide form of the protein, e.g. by reacting 
irreversibly with the sulfenyl amide moiety, and in so doing, preventing 
reactivation of the sulfenyl amide PTP by physiological cell cycling. 

A compound according to any one of claims 47 to 51, which compound is 
capable of binding to a first binding site of the sulfenyl amide PTP 
constituted by a groove lined by residues 41-47 of the phosphotyrosine 
recognition loop, residues 88-90, 1 15 to 120, residues 179 to 184 of the 
WPD-loop, residues 215 to 219 of the phosphate-binding cradle, and 
residues 262-266. 

A compound according to claim 52 having a molecular shape and charge 
distribution that enables it to make polar interactions at the first binding site 
with one or more of: 

(1) Lys41 

(2) Asn42 

(3) Arg45 

(4) Tyr46 

(5) Arg47 

(6) Asn90 

(7) Glnll5 
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(8) 


Lysll6 


(9) 


Serll8 


(10) 


Lysl20 


(11) 


Trpl79 


(12) 


Ser216 


(13) 


Arg221 


(14) 


Gln262 


(15) 


Thr263 


(16) 


Asp265, and 


(17) 


Gln266; 



wherein the amino acid numbering refers to the numbering of the 
corresponding active form of PTP1B. 

54. A compound according to claim 53 having a molecular shape and charge 
distribution that enables it to make polar interactions with two or more of 
moieties (1) to (17), more preferably three or more, for example four or 
more, and more particularly five or more. 

55. A compound according to any one of claims 52 to 54 having a molecular 
shape and charge distribution that enables it to make hydrophobic 
interactions with one or more of: 



(18) 


Leu88 


(19) 


Pro89 


(20) 


Leull9 


(21) 


Phel82 


(22) 


Glyl83 


(23) 


Vail 84 


(24) 


Ala217 


(25) 


Ile219 


(26) 


the apolar part of Arg221, and 


(27) 


the apolar part of Gln262. 
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A compound according to claim 55 having a molecular shape and charge 
distribution that enables it to make hydrophobic interactions with two or 
more of the moieties (18) to (27), more preferably three or more, for 
example five or more. 

A compound according to any one of claims 47 to 56, which compound is 
capable of binding to a second binding site of the sulfenyl amide PTP 
constituted by a shallow depression defined by residues of the WPD-loop, 
the pTyr recognition loop and the loop containing residues 28-32. 

A compound according to claim 57 having a molecular shape and charge 
distribution that enables it to make polar interactions at the second binding 
site with one or more of: 



(44) 


Arg24 


(14) 


Gln262 


(45) 


Arg254 


(46) 


Asn44 


(5) 


Arg47 


(4) 


Tyr46 


(1) 


Lys41 


(47) 


Lys36 


(48) 


Asp29 


(49) 


Cys32 and 


(50) 


Ser50 



59. A compound according to claim 57 or claim 58 having a molecular shape 

and charge distribution that enables it to make hydrophobic interactions with 

one or more of: 

(51) Leu250 

(14) GIn262 

(41) Met258 

(35) Val49 

(4) Tyr46 
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(39) 


Gly218 


(52) 


Gly259 


(53) 


Phe52 


(42) 


Leu260 


(54) 


Leu261 


(55) 


Ala35 and 


(56) 


the backbone of Asp48. 



60. A compound according to any one of claims 47 to 59, which compound is 
capable of binding to a third binding site in the form of a cavity having 
walls formed by Asp48, Val49, Leu83, Gly218, Gly220, Ser222, Arg257, 
Gly259, Gln262 and the sulfenyl-amide. 

61 . A compound according to claim 60 having a molecular shape and charge 
distribution that enables it to make polar interactions at the third binding site 
with one or more of: 

(3) Arg45 

(29) Asp48 

(30) Ser222 

(31) Arg257 
(14) Gln262 

(33) the protein backbone of one or more of (i) Thr84, (ii) Gly2 1 8, (iii) 
Gly220, (iv) Gly223, (v) Met258, (vi) and Gly259; and 

(34) the sulfenyl-amide residue. 

62. A compound according to claim 61 having a molecular shape and charge 
distribution that enables it to make polar interactions at two or more (more 
preferably three or more, four or more, or five or more) of the residues (3), 
(29) to (31), (14), (33) and (34). 

63. A compound according to any one of claims 60 to 62 having a molecular 
shape and charge distribution that enables it to make hydrophobic 
interactions at the third binding site with one or more of: 
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(35) 


Val49 


(36) 


Leu83 


(37) 


Gln85 


(38) 


Gly86 


(39) 


Gly218 


(40) 


Gly220 


(41) 


Met258 


(42) 


Leu260 and 


(43) 


the main chain of His214. 



A compound according to claim 50 or claim 51 which is a nucleophilic 
ligand, having a nucleophilic group that will react with the sulfenyl amide 
moiety, and a binding region for binding to the sulfenyl amide PTP in the 
region of the sulfenyl amide moiety. 

A compound according to claim 64 wherein the binding region has a 
molecular shape and charge distribution that enables it to make an 
interaction with the first and second binding sites as defined in any one of 
claims 52 to 63. 

A compound according to claim 65 wherein the nucleophilic group contains 
a heteroatom (e.g. selected from nitrogen, sulphur, oxygen and phosphorus) 
that is either neutral or negatively charged, and which is capable of reacting 
with the sulfenyl amide species. 

A compound according to claim 66 wherein the heteroatom is selected from 
nitrogen, oxygen and sulfur nucleophiles. 

A compound according to claim 67 wherein the nucleophilic group is 
selected from the group consisting of a thiol, disulfane, primary thioamide, 
secondary thioamide, primary thiourea, secondary thiourea, primary amine, 
secondary amine, primary hydrazine, secondary hydrazine, primary 
hydrazide, secondary hydrazide, primary hydrazone, secondary hydrazone, 
primary amide, secondary amide, primary urea, secondary urea, primary 



10 64. 



65. 
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sulfonamide, secondary sulfonamide, 5-membered ring heterocycle 
containing NH, alcohol, hydroxylamine, oxime, hydroxamic acid, 
carboxylic acid (preferably other than an oxalamic acid), sulfoxide, sulfate 
and a nitrone. 

5 69. A compound according to claim 67 or claim 68 wherein the nucleophile is 
selected from the group consisting of the nucleophiles set out in Table 3 
below, and L is the residue of the compound. 



Type of 
nucleophile 


Structure 


Name 


Sulphur 


L — on 


Thiol 




L— S-SH 


Disulfane 




l Ah 2 

2 


Primary Thioamide 




l-A ,. 

H 


Secondary Thioamide 




L-N^NH 2 


Primary thiourea 




L— N-^-N— L' 
H H 


Secondary thiourea 


Nitrogen 


L— NH 2 


Primary amine 




L— N— L' 
H 


Secondary amine 
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L-N-NH 2 


Primary Hydrazine 




1 — M — N — I ■ 
H H 


oeconuary xiyuxazinc 




L X m-NH, 
H 


Primary Hydrazide 




,J H 

H «- 


Secondary Hydrazide 




L=N-NH 2 


Primary Hydrazone 




L=N-N— L' 
H 


Secondary Hydrazone 




A 

NH 2 


Primary amide 




H 


Secondary amide 




O 

L— N-^-NH, 
H 


Primary urea 




o 

L— N-^N— L' 

h n 


Secondary urea 
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a 

L-S-NH, 

1 1 

o 


Primary Sulfonamide 




o 

II 

L — S — M — 1 ■ 

o 


Secondary Sulfonamide 




L^V H 


5-membered ring heterocycle 
containing NH 


Oxygen 


L-OH 


Alcohol 




L— N-OH 
H 


Hydroxylamine 




L— N-OH 


Oxime 




N- OH 
H 


Hydroxamic acid 




A 

OH 


Carboxylic acid (preferably 
not oxalamic acids) 




+ 

L— S-O 


Sulfoxide 




O 

II 

L-fl-O 

o 


Sulfate 




+ - 

L=N-0 


Nitrone 
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A compound according to any one of claims 47 to 69 which comprises a 
scaffold formed from one or more optionally substituted carbocyclic or 
heterocyclic ring systems, the ring systems and/or the substituents having 
one or more polar or non-polar moieties for interacting with the first and/or 
second binding sites. 

A compound according to claim 70 wherein the carbocyclic and 
heterocyclic ring systems contain at least one an aromatic ring having from 
5 to 12 ring members, more usually from 5 to 10 ring members. 

A compound according to claim 71 containing a heteroaryl group which is 
a five membered or six membered monocyclic ring or a bicyclic structure 
formed from fused five and six membered rings or two fused six membered 
rings, each ring for example containing up to about four heteroatoms 
typically selected from nitrogen, sulphur and oxygen, preferably up to 3 
heteroatoms, more usually up to 2, for example a single heteroatom. 

A compound according to claim 72 wherein the heteroaryl group is selected 
from the group consisting of pyridyl, pyrrolyl, furanyl, thiophenyl, 
imidazolyl, oxazolyl, oxadiazolyl, oxatriazolyl, isoxazolyl, thiazolyl, 
isothiazolyl, pyrazolyl, pyrazinyl, pyrimidinyl, triazinyl, triazolyl, tetrazolyl, 
quinolinyl, isoquinolinyl, benzfuranyl, benzthiophenyl, chromanyl, 
thiochromanyl, benzimidazolyl, benzoxazolyl, benzisoxazole, benzthiazolyl 
and benzisothiazole, isobenzofuranyl, isoindolyl, indolizinyl, indolinyl, 
isoindolinyl, purinyl (e.g., adenine, guanine), indazolyl, benzodioxolyl, 
chromenyl, isochromenyl, chroman, isochromanyl, benzodioxanyl, 
quinolizinyl, benzoxazinyl, benzodiazinyl, pyridopyridinyl, quinoxalinyl, 
quinazolinyl, cinnolinyl, phthalazinyl, naphthyridinyl and pteridinyl. 

A compound according to claim 71 containing at least one carbocyclic aryl 
group selected from the group consisting of phenyl, naphthyl, indenyl, and 
tetrahydronaphthyl. 
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A compound according to claim 70 containing at least one non-aromatic 
heterocyclic group having from 3 to 12 ring members, more usually 5 to 10 
ring members. 

A compound according to claim 75 wherein the non-aromatic heterocyclic 
group is monocyclic or bicyclic, and is optionally selected from the group 
consisting of cyclic ether moieties (e.g as in tetrahydrofuran and dioxane), 
cyclic thioether moieties (e.g. as in tetrahydrothiophene), cyclic amine 
moieties (e.g. as in pyrrolidine), cyclic sulphones (e.g. as in sulfolane and 
sulfolene)), cyclic sulphoxides, cyclic sulphonamides and combinations 
thereof. 

A compound according to claim 76 wherein the non-aromatic heterocyclic 
group is selected from the group consisting of morpholine, piperidine, 
pyrrolidine, pyrrolidone, tetrahydrofuran, tetrahydrothiophene, dioxan, 
tetrahydropyran, imidazoline, imidazolidinone, oxazoline, thiazoline, 
piperazine, and N-alkyl piperazines such as N-methyl piperazine. 

A compound according to any one of claims 70 to 77 wherein the 
carbocyclic and heterocyclic groups are substituted by one or more 
substituent groups selected from halogen, hydroxy, trifluoromethyl, cyano, 
nitro, carboxy, amino, carbocyclic and heterocyclic groups having from 3 to 
12 ring members; a group R a -R b wherein R a is a bond, O, CO, X*C(X 2 ), 
C(X 2 )XS X^CX^X 1 , S, SO, S0 2 , NR c R d , S0 2 NR c or NR c S0 2 ; and R b is 
selected from hydrogen, carbocyclic and heterocyclic groups having from 3 
to 7 ring members, and a Ci. 8 hydrocarbyl group optionally substituted by 
one or more substituents selected from hydroxy, oxo, halogen, cyano, nitro, 
amino, mono- or di-C M hydrocarbylamino, carbocyclic and heterocyclic 
groups having from 3 to 12 ring members and wherein one or more carbon 
atoms of the Ci. 8 hydrocarbyl group may optionally be replaced by O, S, 
SO, S0 2 , NR C , X'C(X 2 ), C(X 2 )X* or X l C(X 2 )X l ; 

R° and R d are the same or different and each is hydrogen or Cm 
hydrocarbyl; 
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X 1 is O, S or NR° and X 2 is =0, =S or =NR°. 

A pharmaceutical composition comprising a compound as defined in any 
one of claims 47 to 78 and a phannaceuticaily acceptable excipient. 

A compound as defined in any one of claims 47 to 78 for use in medicine, 
for example in the prevention or treatment of a disease state or condition 
mediated by PTP such as PTP1B. 

The use of a compound as defined in anyone of claims 47 to 78 for the 
manufacture of a medicament for the prevention or treatment of a disease 
state or condition mediated by PTP such as PTP IB. 

A method for the prevention or treatment of a disease state or condition 
mediated by PTP such as PTP IB in a patient (e.g. a human patient) in need 
thereof, which method comprises administering to the patient a 
therapeutically effective amount of a compound as defined in any one of 
claim 47 to 78. 

A use, method, or compound for use as defined in any one claims 80 to 82 
wherein the disease state or condition mediated by PTP such as PTP1B is 
selected from cancer, diabetes, rheumatoid arthritis and hypertension. 

A three-dimensional representation of a PTP sulfenyl amide or a portion of 
a PTP sulfenyl amide, which representation comprises all or a portion of the 
coordinates of Table 1 or Table 2 ± root mean square deviation from the Ca 
atoms of not more than 1 .5A. 

The three-dimensional representation of claim 84, which is a model 
constructed from all or a portion of the coordinates of Table 1 or Table 2 ± 
root mean square deviation from the Ca atoms of not more than 1 .5 A. 

The model of claim 85 wherein the portion of PTP sulfenyl amide is a 
binding cavity and the portion of the coordinates of Table 1 or Table 2 ± 
root mean square deviation from the Ca atoms of not more than 1 .5A 
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comprise those of atoms defining a binding site within the binding cavity 
(for example, the "selected coordinates" as defined herein). 

87. A three-dimensional representation of a compound, which fits the model of 
claim 85 or claim 86 ± root mean square deviation from the Ca atoms of not 

5 more than 1.5 A. 

88. The three-dimensional representation of claim 87, which is a model of the 
compound. 

89. The model of claim 88 wherein the compound is a pharmacophore. 

90. The model of any one of claims 85, 86, 88 or 89 which is: (a) a wire-frame 
10 model; (b) a chicken- wire model; (c) a ball-and-stick model; (d) a space- 
filling model; (e) a stick-model; (f) a ribbon model; (g) a snake model; (h) 
an arrow and cylinder model; (i) an electron density map; (j) a molecular 
surface model. 

9 1 . The model of any one of claims 85, 86, 88, 89 or 90 which is in physical 
15 form. 

92. The model of any one of claims 85, 86, 88, 89 or 90 which is in electronic 
form. 

93. The model of claim 92, which comprises a graphical image display on a 
computer screen. 

20 94. A computer-based method for the analysis of the interaction of a molecular 
structure with a PTP sulfenyl amide structure of the invention, which 
comprises: (a) providing a PTP sulfenyl amide model as defined in claim 
85, 86 or 90 to 93; (b) providing a molecular structure to be fitted to said 
PTP sulfenyl amide model; and (c) fitting the molecular structure to the PTP 

25 sulfenyl amide model to produce a compound model as defined in claim 88, 

89 or 90 to 93. 
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